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TITLE OF THE INVENTION 

HUMAN N-METHYL-D-ASPARTATE RECEPTOR SUBUNITS. NUCLEIC 
ACIDS ENCODING SAME AND USES THEREFOR 

5 CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of United States Serial No. 
08/052,449, filed April 20, 1993, now pending. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
10 Not applicable. 

REFERENCE TO MICROFICHE APPENDIX 
Not applicable. 

15 FIELD OF THE INVENTION 

The present invention relates to nucleic acids and receptor proteins 
encoded thereby. Invention nucleic acids encode novel human N-methyl-D-aspartate 
(NMDA) receptor subunits. The invention also relates to methods for making such 
receptor subunits and for using the receptor proteins in assays designed to identify and 

20 characterize compounds which affect the function of such receptors, e.g., agonists and 
antagonists of NMDA receptors. 

BACKGROUND OF THE INVENTION 

The amino acid L-glutamate is a major excitatory 

25 neurotransmitter in the mammalian central nervous system. Anatomical, biochemical 
and electrophysiological analyses suggest that glutamatergic systems are involved in a 
broad array of neuronal processes, including fast excitatory synaptic transmission, 
regulation of neurotransmitter releases, long-term potentiation, learning and memory, 
developmental synaptic plasticity, hypoxic-ischemic damage and neuronal cell death, 

30 epileptiform seizures, as well as the pathogenesis of several neurodegenerative 
disorders. See generally, Monaghan et al., Ann. Rev. Pharmacol. Toxicol. 29:365- 
402 (1980). This extensive repertoire of functions, especially those related to 
learning, neurotoxicity and neuropathology, has stimulated recent attempts to describe 
and define the mechanisms throush which elutamate exerts its effects. 
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Currently, glutamate receptor classification schemes are based on 
pharmacological criteria. Glutamate has been observed to mediate its effects through 
receptors that have been categorized into two main groups: lonotropic and 
metabotropic. lonotropic glutamate receptors contain integral cation-specific, ligand- 
5 gated ion channels, whereas metabotropic glutamate receptors are G-protein-coupled 
receptors that transduce extracellular signals via activation of intracellular second 
messenger systems. lonotropic receptors are further divided into at least two 
categories based on the pharmacological and functional properties of the receptors. 
The two main types of ionotropic receptors are N-methyl-D-aspartic acid (NMDA) 

10 and kainic acid (KA)/a-amino-3-hydroxy-5-methyl-isoxazole-4-propionic acid 
(AMPA), formerly called the quisqualic acid, or QUIS, receptor. While the 
metabotropic receptors bind to some of the same ligands that bind to ionotropic 
glutamate receptors, the metabotropic receptors alter synaptic physiology via GTP- 
binding proteins and second messengers such as cyclic AMP, cyclic GMP, 

15 diacylglycerol, inositol 1,4,5-triphosphate and calcium [Gundersen et al., Proc. R. 
Soc. London Ser. 221:127 (1984); Sladeczek et al., Nature 311:717 (1985); Nicoletti 
et al., J. Neurosci. 6:1905 (1986); Sugiyama et al., Nature 325:531 (1987)]. 

The electrophysiological and pharmacological properties of the 
glutamate receptors have been studied using animal tissues and cell lines, as well as 

20 recombinantly produced non-human receptors, as the source of such receptors. The 
value of such studies for application to the development of human therapeutics has 
been limited by the availability of only non-human receptor subunits. Moreover, it is 
only recently that the characteristics and structure of glutamate receptors have been 
investigated at the molecular level. The majority of such investigation has, however, 

25 been carried out in non-human species. Because of the potential physiological and 
pathological significance of glutamate receptors, it would be desirable (for example, 
for drug screening assays) to have available human sequences (i.e., DNA, RNA, 
proteins) which encode representative members of the various glutamate receptor 
subtypes. The availability of such human sequences will also enable the investigation 

30 of receptor distribution in humans, the correlation of specific receptor modification 
with the occurrence of various disease states, etc. 

SUMMARY OF THE INVENTION 

The present invention discloses novel nucleic acids encoding NMDA 
35 receptor protein subunits and the proteins encoded thereby. In a particular 
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embodiment the novel nucleic acids encode NMDAR1 and NMDAR2 subunits of 
human NMDA receptors. More specifically, the invention nucleic acids encode 
NMDAR1. NMDAR2A, NMDAR2B, NMDAR2C and NMDAR2D subunits that 
contribute to the formation of NMDA-activated cation-selective ion channels. In 
5 addition to being useful for the production of NMDA receptor subunit proteins, these 
nucleic acids are also useful as probes, thus enabling those skilled in the art, without 
undue experimentation, to identify and isolate nucleic acids encoding related receptor 
subunits. 

Functional glutamate receptors can be assembled, in accordance with 
10 the present invention, from a plurality of NMDA receptor subunit proteins of one type 
(homomeric) or from combinations of subunit proteins of different types 
(heteromeric). 

In addition to disclosing novel NMDA receptor protein subunits, the 
present invention also comprises methods for using such receptor subunits to identify 
15 and characterize compounds which affect the function of such receptors, e.g., 

agonists, antagonists, and modulators of glutamate receptor function. The invention 
also comprises methods for determining whether unknown protein(s) are functional as 
NMDA receptor subunits. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic representation of various human NMDAR1 
clones of the invention, with partial restriction maps of each clone. The clones are 
aligned and the differences in the DNAs (i.e., deletions and insertions), relative to 
clone NMDA 10, are indicated. Translation initiation and termination sites are 

25 represented by a "V" and a "*", respectively. Insertions are marked as inverted 
triangles, deletions are indicated by spaces in the boxes. The numbers above the 
insertions and deletions refer to the number of nucleotides inserted or deleted relative 
to NMDA 10. 

Figure 2 is a schematic representation of cDNAs encoding full-length 
30 human NMDAR1 subunit subtypes of the invention, with partial restriction maps of 
each DNA. The full-length cDNAs are constructed by ligation of appropriate portions 
of the clones shown in Figure 1 . Regions of each full-length cDNA composed of 
nucleotide sequences corresponding to a particular clone are distinguished as solid, 
striped, cross-hatched or open boxes. 
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Figure 3 presents the entire nucleotide sequence of construct 
NMDAR1A (see Sequence ID No. 1) with the following information added for ease 
of comparison of the splice variations of the NMDAR1 subunit transcript: low ercase 
letters indicate 5' untranslated sequence and the 3' untranslated sequence of the 
5 NMDAR1 splice variant shown in Sequence ID No. 1 (in some of the other splice 
variants, this 3' untranslated sequence is actually coding sequence); uppercase letters 
indicate coding sequence; the translation initiation codon is identified by the word 
"START" w hereas the three different translation termination codons (TGA) used in 
the different splice variants are identified by small boxes; significant restriction 

10 enzyme sites used in preparing full-length variant constructs are identified by name 
above the sites; the location of a 63-bp insertion (see Sequence ID No. 3) that exists in 
some of the variants is marked as "63 bp INSERT"; the nucleotide sequences that are 
deleted from some of the variants are boxed and labeled as "204 bp DELETION," 
"363 bp DELETION," and "1087 bp DELETION." 

15 Figure 4 is a schematic representation of various human NMDAR2C 

clones of the invention, with partial restriction maps of each clone. The clones are 
aligned and the differences in the DNAs relative to clone NMDA26 are indicated in 
the same manner as done in Figure 1. 

Figure 5 is a schematic representation of full-length human 

20 NMDAR2C subunit subtypes of the invention, with partial restriction maps of each 
DNA. The full-length cDNAs are constructed by ligation of appropriate portions of 
the clones shown in Figure 4. Regions of each full-length cDNA composed of 
nucleotide sequences corresponding to a particular clone are distinguished as solid, 
striped, cross-hatched or open boxes. 

25 Figure 6 presents restriction maps of CMV promoter-based vectors 

pCMV-T7-2 and pCMV-T7-3. 

DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, there are provided isolated 

30 nucleic acids encoding human N-methyl-D-aspartate (NMDA) receptor subunit(s). In 
one aspect of the present invention, nucleic acids encoding NMDA receptor subunit(s) 
of the NMDAR1 subtype are provided. In another aspect, nucleic acids encoding 
NMDA receptor subunit(s) of the NMDAR2 subtype are provided. In a further 
aspect, eukaryotic cells containing such nucleic acids, and eukaryotic cells expressing 

35 such nucleic acids are provided. 
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Also provided are protein(s) encoded by the above-described nucleic 
acids, as well as antibodies generated against the protein(s). In other aspects of the 
present invention, there are provided nucleic acid probes comprising at least NMDA 
receptor subunit-selective portions of the above-described nucleic acids. 
5 As employed herein, the phrase "human N-methyl-D-aspartate 

(NMDA) receptor subunit(s)" refers to recombinantly produced (i.e., isolated or 
substantially pure) proteins which participate in the formation of a voltage-sensitive 
cation-selective channel activated by exposure to NMDA, and having at least one 
transmembrane domain, a large N-terminal extracellular domain, and the like, 

10 including variants thereof encoded by mRNA generated by alternative splicing of a 
primary transcript, and further including fragments thereof which retain one or more 
of the above properties. 

Use of the phrase "recombinantly produced", "isolated" or 
"substantially pure" in the present specification and claims as a modifier of DNA, 

15 RNA, polypeptides or proteins means that the DNA, RNA, polypeptides or proteins 
so designated have been produced in such form by the hand of man, and thus are 
separated from their native in vivo cellular environment. As a result of this human 
intervention, the recombinant DNAs, RNAs, polypeptides and proteins of the 
invention are useful in ways that the DNAs, RNAs, polypeptides or proteins as they 

20 naturally occur are not, such as identification of selective drugs or compounds. 

The term "functional", when used herein as a modifier of receptor 
protein(s) of the present invention, means that binding of NMDA (or NMDA-like) 
ligand to receptors comprising the protein(s) causes the receptor "ion channels" to 
open. This allows cations, particularly Ca^+, as well as Na+ and K+, to move across 

25 the membrane. Stated another way, "functional" means that a signal is generated as a 
consequence of agonist activation of receptor protein(s). 

As used herein, a splice variant refers to variant NMDA receptor 
subunit-encoding nucleic acid(s) produced by differential processing of primary 
transcript(s) of genomic DNA, resulting in the production of more than one type of 

30 mRNA. cDNA derived from differentially processed primary transcript will encode 
NMDA receptor subunits that have regions of complete amino acid identity and 
regions having different amino acid sequences. Thus, the same genomic sequence can 
lead to the production of multiple, related mRNAs and proteins. Both the resulting 
mRNAs and proteins are referred to herein as "splice variants". 
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Accordingly, also contemplated within the scope of the present 
invention are DNAs that encode NMDA receptor subunits as defined above, but that 
by virtue of degeneracy of the genetic code do not necessarily hybridize to the 
disclosed DNA under specified hybridization conditions. Such subunits also 
5 contribute to the formation of functional receptor, as assessed by methods described 
herein or known to those of skill in the art, with one or more additional NMDA 
receptor subunits of the same or different type (the presence of additional subunits of 
a different type is optional when said subunit is an NMDAR1 subunit). Typically, 
unless an NMDA receptor subunit is encoded by RNA that arises from alternative 

10 splicing (i.e., a splice variant), NMDA receptor subunit-encoding DNA and the 

NMDA receptor subunit encoded thereby share substantial sequence homology with 
at least one of the NMDA receptor subunit DNAs (and proteins encoded thereby) 
described herein. It is understood that DNA or RNA encoding a splice variant may 
share less than 90% overall sequence homology with the DNA or RNA provided 

15 herein, but include regions of nearly 100% homology to a DNA fragment described 
herein, and encode an open reading frame that includes start and stop codons and 
encodes a functional NMDA receptor subunit. 

As employed herein, the phrase "NMDA receptor subunit(s) of the 
NMDAR1 subtype" refers to proteins which, by hydrophobicity analysis of deduced 

20 amino acid sequences, are believed to contain four or more putative transmembrane 
domains, preceded by a large extracellular N-terminal domain. The amino acid 
sequence typically contains possible phosphorylation sites for Ca2+/calmodulin- 
dependent protein kinase type II and protein kinase C [see, for example, Kemp et al. 
(1990) Trends in Biological Science Vol. 15:342-346; Kishimoto et al. (1985) J. Biol. 

25 Chem. Vol. 260:12492-12499; Whittemore et al. (1993) Nature 364:70-73]. (These 
protein kinases reportedly play a crucial role in induction and maintenance of long 
term potentiation.) 

The putative TMII segment (i.e., second transmembrane domain) is 
typically flanked by a glutamic acid residue at the extracellular side and a stretch of 

30 glutamic acid residues at the cytoplasmic side. This segment contains an asparagine 
residue believed to be responsible for high Ca^+ permeability of the NMDAR 
channel. For a summary of NMDAR properties, see Ben-Ari et al., in TINS 15:333- 
339 (1992), especially at p. 334. 

Exemplary DNA sequences encoding human NMDAR 1 subunits are 

35 represented by nucleotides which encode substantially the same amino acid sequence 
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as set forth in Sequence ID Nos. 2, 2E, 2F, 2G, 2H, 21, 2J, 2K, 2L, 2M, 2N, or 2P. 
Presently preferred sequences encode substantially the same amino acid sequence as 
set forth in Sequence ID Nos. 2, 2E, 2F, 2G, 2H, 21 or 2P. 

Exemplary DNA can alternatively be characterized as those nucleotide 
5 sequences which encode a human NMDAR1 subunit and hybridize under high 
stringency conditions to substantially the entire sequence of any one of Sequence ID 
Nos. 1, 1A, IB, 1C, ID, IE, IF, 1G, 1H, II, 1J, IK, 1L, 1M, IN, or IP, or substantial 
portions thereof (i.e., typically at least 25-30 nucleotides thereof); preferably 
exemplary DNA will hybridize under high stringency conditions to substantially the 
10 entire sequence of any one of Sequence ID Nos. 1, IE, IF, 1G, 1H, II or IP, or 
substantial portions thereof. 

Stringency of hybridization is used herein to refer to conditions under 
which polynucleic acid hybrids are stable. As known to those of skill in the art, the 
stability of hybrids is reflected in the melting temperature (T m ) of the hybrids. T m 

15 can be approximated by the formula: 

81.5°C - 16.6(logio[Na+]) + 0.41 (%G+C) - 600/1, 

where 1 is the length of the hybrids in nucleotides. T m decreases approximately 
1-1.5°C with every 1% decrease in sequence homology. In general, the stability of a 
hybrid is a function of sodium ion concentration and temperature. Typically, the 
hybridization reaction is performed under conditions of lower stringency, followed by 
washes of varying, but higher, stringency. Reference to hybridization stringency 
relates to such washing conditions. Thus, as used herein: 

(1) HIGH STRINGENCY conditions, with respect to fragment 

hybridization, refers to conditions that permit hybridization of 
only those nucleic acid sequences that form stable hybrids in 
0.018M NaCl at 65°C (i.e., if a hybrid is not stable in 0.018M 
30 NaCl at 65°C, it will not be stable under high stringency 

conditions, as contemplated herein). High stringency 
conditions can be provided, for example, by hybridization in 
507c formamide, 5X Denharts solution, 5X SSPE, 0.27c SDS at 
42°C, followed by washing in 0.1 X SSPE, and 0.1 7c SDS at 
35 65°C; 
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(2) MODERATE STRINGENCY conditions, with respect to 
fragment hybridization, refers to conditions equivalent to 
hybridization in 507c formamide, 5X Denhart s solution, 5X 
SSPE, 0.2 r f SDS at 42°C, followed by washing in 0.2X SSPE, 

5 0.27c SDS, at 65°C; 

(3) LOW STRINGENCY conditions, with respect to fragment 
hybridization, refers to conditions equivalent to hybridization in 
10% formamide, 5X Denhart s solution, 6X SSPE, 0.27c SDS at 
42°C, followed by washing in IX SSPE, 0.27c SDS, at 50°C; 

10 and 

(4) HIGH STRINGENCY conditions, with respect to 
oligonucleotide (i.e., synthetic DNA < about 30 nucleotides in 
length) hybridization, refers to conditions equivalent to 
hybridization in 10% formamide, 5X Denhart s solution, 6X 

15 SSPE, 0.2% SDS at 42°C, followed by washing in IX SSPE, 

and 0.2% SDS at 50°C. 

It is understood that these conditions may be duplicated using a variety of buffers and 
temperatures and that they are not necessarily precise. 

20 Denhart's solution and SSPE (see, e.g., Sambrook, Fritsch, and 

Maniatis, in: Molecular Cloning, A Laboratory Manual , Cold Spring Harbor 
Laboratory Press, 1989) are well known to those of skill in the art as are other suitable 
hybridization buffers. For example, SSPE is pH 7.4 phosphate-buffered 0.1 8M NaCl. 
SSPE can be prepared, for example, as a 20X stock solution by dissolving 175.3 g of 

25 NaCl, 27.6 g of NaH2P04 and 7.4 g EDTA in 800 ml of water, adjusting the pH to 

7.4, and then adding water to 1 liter. Denhart s solution (see, Denhart (1966) 
Biochem. Biophys. Res. Commun. 23:641 ) can be prepared, for example, as a SOX 
stock solution by mixing 5 g Ficoll (Type 400, Pharmacia LKB Biotechnology, INC., 
Piscataway, NJ ), 5 g of polyvinylpyrrolidone, 5 g bovine serum albumin (Fraction V; 

30 Sigma, St. Louis, MO) water to 500 ml and filtering to remove particulate matter. 

Especially preferred sequences are those which have substantially the 
same nucleotide sequence as the coding sequences in any one of Sequence ID Nos. 1, 
IE, IF, 1G, 1H, II, 1J, IK, 1L, 1M, IN, or IP; with those having substantially the 
same sequence as the coding sequence in Sequence ID Nos. 1, IE, IF, 1G, 1H, II or 

35 IP being most preferred. 
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As used herein, the phrase "substantial sequence homology" refers to 
nucleotide sequences which share at least about 909f identity, and amino acid 
sequences which typically share more than 957c amino acid identity (>997c amino 
acid identity when dealing with NMDAR1 subunits). It is recognized, however, that 
5 proteins (and DNA or mRNA encoding such proteins) containing less than the above- 
described level of homology arising as splice variants or that are modified by 
conservative amino acid substitutions (or substitution of degenerate codons) are 
contemplated to be within the scope of the present invention. 

As used herein, the phrase "substantially the same" refers to the 

10 nucleotide sequences of DNA, the ribonucleotide sequences of RNA, or the amino 
acid sequences of protein, that have slight and non-consequential sequence variations 
from the actual sequences disclosed herein. Species that are "substantially the same" 
are considered to be equivalent to the disclosed sequences, and as such are within the 
scope of the appended claims. In this regard, "slight and non-consequential sequence 

15 variations" mean that sequences that are substantially the same as the DNA, RNA, or 
proteins disclosed and claimed herein, are functionally equivalent to the human- 
derived sequences disclosed and claimed herein. Functionally equivalent sequences 
will function in substantially the same manner to produce substantially the same 
compositions as the human-derived nucleic acid and amino acid compositions 

20 disclosed and claimed herein. In particular, functionally equivalent DNAs encode 
human-derived proteins that are the same as those disclosed herein or that have 
conservative amino acid variations, such as substitution of a non-polar residue for 
another non-polar residue or a charged residue for a similarly charged residue. These 
changes include those recognized by those of skill in the art as those that do not 

25 substantially alter the tertiary structure of the protein. 

As employed herein, the phrase "NMDA receptor subunit(s) of the 
NMDAR2 subtype" refers to proteins which have a large putative extracellular 
domain at the amino-terminal region. Otherwise, the deduced structure of NMDAR2 
subunits displays the same general characteristics as the NMDAR1 subunit structure. 

30 A notable typical exception is that the negatively charged glutamic acid residues that 
are generally present in the putative TMII segment of NMDAR1 subunits are 
generally absent from the TMII segment of NMDAR2. Instead, NMDAR2 subunits 
may contain a positively charged lysine residue in TMII. Unlike NMDAR1 subunits. 
NMDAR2 subunits generally do not form homomeric NMDA receptors. Moreover, 
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the amino acid sequences of NMDAR1 and NMDAR2 subunits are generally less 
than 50 c /c identical, with identities of less than 307c typically observed. 

NMDAR2 subunits contemplated by the present invention include 
NMDAR2A, NMDAR2B, NMDAR2C and NMDAR2D types of subunits. 
5 Exemplary DNA sequences encoding human NMDAR2A subunits, or portions 
thereof, are represented by nucleotides which encode substantially the same amino 
acid sequence as set forth in Sequence ID No. 1 1, or substantially the same amino 
acid sequence as that encoded by the NMDAR2A-encoding portion of clone 
NMDA57, deposited with the ATCC under accession number 75442. 

10 The deposited clone has been deposited at the American Type Culture 

Collection (ATCC), 12301 Parklawn Drive, Rockville, Maryland, U.S.A. 20852, 
under the terms of the Budapest Treaty on the International Recognition of Deposits 
of Microorganisms for Purposes of Patent Procedure and the Regulations promulgated 
under this Treaty. Samples of the deposited material are and will be available to 

15 industrial property offices and other persons legally entitled to receive them under the 
terms of the Treaty and Regulations and otherwise in compliance with the patent laws 
and regulations of the United States of America and all other nations or international 
organizations in which this application, or an application claiming priority of this 
application, is filed or in which any patent granted on any such application is granted. 

20 In particular, upon issuance of a U.S. patent based on this or any application claiming 
priority to or incorporating this application by reference thereto, all restriction upon 
availability of the deposited material will be irrevocably removed. 

Exemplary human NMDAR2A subunit-encoding DNAs can 
alternatively be characterized as those nucleotide sequences which hybridize under 

25 high stringency conditions to substantially the entire sequence of Sequence ID No. 10, 
or substantial portions thereof (i.e., typically at least 25-30 nucleotides thereof), or the 
NMDAR2A-encoding portion of clone NMDA57 (ATCC accession No. 75442). 
Especially preferred sequences encoding human NMDAR2A subunits are those which 
have substantially the same nucleotide sequence as the coding sequence of Sequence 

30 ID No. 10, or those which contain substantially the same nucleotide sequence as the 
coding sequence in the NMDAR2A-encoding portion of clone NMDA57. 

Exemplary DNA sequences encoding human NMDAR2B subunits are 
represented by nucleotides which encode substantially the same amino acid sequence 
as set forth in Sequence ID No. 14. Exemplary DNAs can alternatively be 

35 characterized as those nucleotide sequences which encode a human NMDAR2B 
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subunit and hybnclize under high stringency conditions to substantially the entire 
sequence of Sequence ID No. 13, or substantial portions thereof (i.e., typically at least 
25-30 nucleotides thereof). Especially preferred NMDAR2B-encoding sequences are 
those which have substantially the same nucleotide sequence as the coding sequence 
5 in Sequence ID No. 13. 

Exemplary DNA sequences encoding human NMDAR2C subunits are 
represented by nucleotides which encode substantially the same amino acid sequence 
as set forth in Sequence ID Nos. 6, 6E, 6F, 6G, 6H or 61. 

Exemplary DNAs can alternatively be characterized as those 
10 nucleotide sequences which encode a human NMDAR2C subunit and hybridize under 
high stringency conditions to substantially the entire sequence of any one of Sequence 
ID Nos. 5, 5A, 5B, 5C, 5D, 5E, 5F, 5G, 5H, or 51, or substantial portions thereof (i.e., 
typically at least 25-30 nucleotides thereof); preferably exemplary DNA will 
hybridize under high stringency conditions to substantially the entire sequence of any 
15 one of Sequence ED Nos. 5, 5E, 5F, or 5G, or substantial portions thereof. 

Especially preferred NMDAR2C-encoding sequences are those which 
have substantially the same nucleotide sequence as the coding sequences in any one 
of Sequence ID Nos. 5, 5E, 5F, 5G, 5H or 51; with those having substantially the same 
sequence as the coding sequences in Sequence ID Nos. 5, 5E, 5F, or 5G being most 
20 preferred. 

Exemplary DNA sequences encoding human NMDAR2D subunits are 
represented by nucleotides which encode substantially the same amino acid sequence 
as set forth in Sequence ID No. 16. Exemplary DNAs can alternatively be 
characterized as those nucleotide sequences which encode a human NMDAR2D 

25 subunit and hybridize under high stringency conditions to substantially the entire 

sequence of Sequence ID No. 15, or substantial portions thereof (i.e., typically at least 
25-30 nucleotides thereof)- Especially preferred NMDAR2D-encoding sequences are 
those which have substantially the same nucleotide sequence as the coding sequence 
in Sequence ID No. 15. 

30 DNA encoding human NMDA receptor subunits may be isolated by 

screening suitable human cDNA or human genomic libraries under suitable 
hybridization conditions with DNA disclosed herein (including nucleotides derived 
from any of SEQ ID Nos. 1, 1A-1P, 5, 5A-5I, 10, 13 or 15). Suitable libraries can be 
prepared from neuronal tissue samples, e.g., hippocampus and cerebellum tissue, cell 

35 lines, and the like. For example, the library can be screened with a portion of DNA 
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including substantially the entire subunit-encoding sequence thereof, or the library 
may be screened with a suitable probe. 

As used herein, a probe is single-stranded DNA or RNA that has a 
sequence of nucleotides that includes at least 14 contiguous bases that are the same as 
5 (or the complement of) any 14 or more contiguous bases set forth in any of SEQ ID 
Nos. 1, 1A-1P, 5, 5A-5I, 10, 13 or 15. Preferred regions from which to construct 
probes include 5' and/or 3' coding sequences, sequences predicted to encode 
transmembrane domains, sequences predicted to encode cytoplasmic loops, signal 
sequences, NMDA binding sites, and the like. 

10 Either the full-length cDNA clones or fragments thereof can be used as 

probes, preferably labeled with suitable label means for ready detection. When 
fragments are used as probes, preferably the DNA sequences will be from the 
carboxyl end-encoding portion of the DNA, and most preferably will include 
predicted transmembrane domain-encoding portions of the DNA sequence (the 

15 domains can be predicted based on hydropathy analysis of the deduced amino acid 
sequence using, for example, the method of Kyte and Doolittle (1982), J. Mol . Biol . 
Vol. 157 :105). These probes can be used, for example, for the identification and 
isolation of additional members of the glutamate receptor family. 

As a particular application of the invention sequences, genetic 

20 screening can be carried out using the nucleotide sequences of the invention as 

probes. Thus, nucleic acid samples from patients having neuropathological conditions 
suspected of involving alteration/modification of any one or more of the glutamate 
receptors can be screened with appropriate probes to determine if any abnormalities 
exist with respect to any of the endogenous glutamate receptors. Similarly, patients 

25 having a family history of disease states related to glutamate receptor dysfunction can 
be screened to determine if they are also predisposed to such disease states. 

In accordance with another embodiment of the present invention, there 
is provided a method for identifying DNA encoding human N-methyl-D-aspartate 
(NMDA) receptor protein subunit(s), said method comprising: 

30 contacting human DNA with a nucleic acid probe as described above, 

wherein said contacting is carried out under high stringency hybridization conditions, 
and 

identifying DNA(s) which hybridize to said probe. 
After screening the library, positive clones are identified by detecting a 
35 hybridization signal; the identified clones are characterized by restriction enzyme 
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mapping and/or DNA sequence analysis, and then examined, by comparison with the 
sequences set forth herein to ascertain whether they include DNA encoding a 
complete NMDA receptor subunit (i.e., if they include translation initiation and 
termination codons). If the selected clones are incomplete, they may be used to 
5 rescreen the same or a different library to obtain overlapping clones. If the library is 
genomic, then the overlapping clones may include exons and introns. If the library is a 
cDNA library, then the overlapping clones will include an open reading frame. In 
both instances, complete clones may be identified by comparison with the DNA and 
encoded proteins provided herein. 

10 Complementary DNA clones encoding various human NMDA receptor 

subunits (e.g., NMDAR1, NMDAR2A, NMDAR2B, NMDAR2C, NMDAR2D) have 
been isolated. Each type of subunit appears to be encoded by a different gene. The 
DNA clones provided herein may be used to isolate genomic clones encoding each 
type of subunit and to isolate any splice variants by screening libraries prepared from 

15 different neural tissues. Nucleic acid amplification techniques, which are well known 
in the art, can be used to locate DNA encoding splice variants of human NMDA 
receptor subunits. This is accomplished by employing oligonucleotides based on 
DNA sequences surrounding divergent sequence(s) as primers for amplifying human 
RNA or genomic DNA. Size and sequence determinations of the amplification 

20 products can reveal the existence of splice variants. Furthermore, isolation of human 
genomic DNA sequences by hybridization can yield DNA containing multiple exons, 
separated by introns, that correspond to different splice variants of transcripts 
encoding human NMDA receptor subunits. 

It has been found that not all subunits (and variants thereof) are 

25 expressed in all neural tissues or in all portions of the brain. Thus, in order to isolate 
cDNA encoding a particular subunit or splice variants thereof, it is preferable to 
screen libraries prepared from different neuronal or neural tissues. Preferred tissues 
to use as sources of nucleic acids for preparing libraries to obtain DNA encoding each 
subunit include: hippocampus to isolate human NMDAR1 -encoding DNAs; 

30 hippocampus, cerebellum and fetal brain to isolate NMDAR2-encoding DNAs; and 
the like. 

Once DNA encoding a subunit has been isolated, ribonuclease (RNase) 
protection assays can be employed to determine which tissues express mRNA 
encoding a particular NMDAR subunit subtype or variant. These assays provide a 
35 sensitive means for detecting and quantitating an RNA species in a complex mixture 
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of total cellular RNA. The subunit DNA is labeled and hybridized with cellular RNA. 
If complementary mRNA is present in the cellular RNA, a DNA-RNA hybrid results. 
The RNA sample is then treated with RNase, which degrades single-stranded RNA. 
Any RNA-DNA hybrids are protected from RNase degradation and can be visualed 
5 by gel electrophorsis and autoradiography. In situ hybridization techniques can also 
be used to determine which tissues express mRNA encoding a particular NMDAR 
subunit. The labeled subunit DNAs are hybridized to different brain region slices to 
visualize subunit mRNA expression. 

The distribution of expression of some human NMDA receptor 

10 subunits may differ from the distribution of such receptors in rat. For example, RNA 
encoding the rat NMDAR2C subunit is abundant in rat cerebellum, but is not 
abundant in rat hippocampus [see, e.g., Monyer et al., Science 256:1217-1221 
(1992)]. Numerous human NMDAR2C clones were ultimately obtained, however, 
from a human hippocampus library. Thus, the distribution of some NMDA receptor 

15 subunits in humans and rats appears to be different. 

The above-described nucleotide sequences can be incorporated into 
vectors for further manipulation. As used herein, vector (or plasmid) refers to discrete 
elements that are used to introduce heterologous DNA into cells for either expression 
or replication thereof. Selection and use of such vehicles are well within the skill of 

20 the artisan. 

An expression vector includes vectors capable of expressing DNAs 
that are operatively linked with regulatory sequences, such as promoter regions, that 
are capable of effecting expression of such DNA fragments. Thus, an expression 
vector refers to a recombinant DNA or RNA construct, such as a plasmid, a phage, 

25 recombinant virus or other vector that, upon introduction into an appropriate host cell, 
results in expression of the cloned DNA. Appropriate expression vectors are well 
known to those of skill in the art and include those that are replicable in eukaryotic 
cells and/or prokaryotic cells and those that remain episomal or those which integrate 
into the host cell genome. Presently preferred plasmids for expression of invention 

30 NMDA receptor subunits in eukaryotic host cells, particularly mammalian cells, 

include cytomegalovirus (CMV) promoter-containing vectors such as pCMV-T7-2 or 
pCMV-T7-3 (see Figure 6), pMMTVT7(+) or pMMTVT7(-) (modified versions of 
pMAMneo (Clontech, Palo Alto, CA), prepared as described herein), pcDNAl, and 
the like. 
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As used herein, a promoter region refers to a segment of DNA that 
controls transcription of DNA to which it is operatively linked. The promoter region 
includes specific sequences that are sufficient for RNA polymerase recognition, 
binding and transcription initiation. This portion of the promoter region is referred to 
5 as the promoter. In addition, the promoter region includes sequences that modulate 
this recognition, binding and transcription initiation activity of RNA polymerase. 
These sequences may be cis acting or may be responsive to trans acting factors. 
Promoters, depending upon the nature of the regulation, may be constitutive or 
regulated. Exemplary promoters contemplated for use in the practice of the present 

10 invention include the SV40 early promoter, the cytomegalovirus (CMV) promoter, the 
mouse mammary tumor virus (MMTV) steroid-inducible promoter, Moloney murine 
leukemia virus (MMLV) promoter, and the like. 

As used herein, the term "operatively linked" refers to the functional 
relationship of DNA with regulatory and effector sequences of nucleotides, such as 

15 promoters, enhancers, transcriptional and translational stop sites, and other signal 
sequences. For example, operative linkage of DNA to a promoter refers to the 
physical and functional relationship between the DNA and the promoter such that the 
transcription of such DNA is initiated from the promoter by an RNA polymerase that 
specifically recognizes and binds to the promoter, and transcribes the DNA. In order 

20 to optimize expression and/or in vitro transcription, it may be necessary to remove, 
add or alter 5' and/or 3' untranslated portions of the clones to eliminate extra, potential 
inappropriate alternative translation initiation (i.e., start) codons or other sequences 
that may interfere with or reduce expression, either at the level of transcription or 
translation. Alternatively, consensus ribosome binding sites (see, for example, Kozak 

25 (1991) J. Biol. Chem. 266:19867-19870) can be inserted immediately 5' of the start 
codon and may enhance expression. Likewise, alternative codons, encoding the same 
amino acid, can be substituted for coding sequences of the NMDAR subunits in order 
to enhance transcription (e.g., the codon preference of the host cells can be adopted, 
the presence of G-C rich domains can be reduced, and the like). Furthermore, for 

30 potentially enhanced expression of NMDA receptor subunits in amphibian oocytes, 
the subunit coding sequence can optionally be incorporated into an expression 
construct wherein the 5 - and 3 -ends of the coding sequence are contiguous with 
Xenopus (3-globin gene 5' and 3 1 untranslated sequences, respectively. For example, 
NMDA receptor subunit coding sequences can be incorporated into vector pSP64T 

35 (see Krieg and Melton (1984) in Nucleic Acids Research 12:7057-7070), a modified 
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form of pSP64 (available from Promega, Madison, WI). The coding sequence is 
inserted between the 5' end of the (3-globin gene and the 3* untranslated sequences 
located downstream of the SP6 promoter. In vitro transcripts can then be generated 
from teh resulting vector. The desirability of (or need for) such modification may be 
5 empirically determined. 

As used herein, expression refers to the process by which polynucleic 
acids are transcribed into mRNA and translated into peptides, polypeptides, or 
proteins. If the polynucleic acid is derived from genomic DNA, expression may, if an 
appropriate eukaryotic host cell or organism is selected, include splicing of the 
10 mRNA. 

Particularly preferred vectors for transfection of mammalian cells are 
the pSV2dhfr expression vectors, which contain the SV40 early promoter, mouse dhfr 
gene, SV40 polyadenylation and splice sites and sequences necessary for maintaining 
the vector in bacteria, cytomegalovirus (CMV) promoter-based vectors such as 

15 pCMV-T7-2 and pCMV-T7-3 (described herein) or pCDNAl (Invitrogen, San Diego, 
CA), and MMTV promoter-based vectors such as pMMTVT7(+) or pMMTVT7(-), 
described herein. 

Full-length DNAs encoding human NMDA receptor subunits have 
been inserted into vectors pcDNAl, pMMTVT7(+), pCMV-T7-2 and pCMV-T7-3. 

20 pCMV-T7-2 is a pUC19-based mammalian cell expression vector containing the 
CMV promoter/enhancer, SV40 splice/donor sites located immediately downstream 
of the promoter, a T7 bacteriophage RNA polymerase promoter positioned 
downstream of the splice sites, followed by an SV40 polyadenylation signal and a 
polylinker between the T7 promoter and the polyadenylation signal. Placement of 

25 NMDA receptor subunit DNA between the CMV promoter and SV40 polyadenylation 
signal should provide for constitutive expression of the foreign DNA in a mammalian 
host cell transfected with the construct. Plasmid pCMV-T7-3 is identical to pCMV- 
T7-2 except that the order of restriction enzyme sites in the polylinker is reversed. 

Vectors pMMTVT7(+) and pMMTVT7(-) were prepared by modifying 

30 vector pMAMneo (Clontech, Palo Alto, CA). pMAMneo is a mammalian expression 
vector that contains the Rous Sarcoma Virus (RSV) long terminal repeat (LTR) 
enhancer, linked to the dexamethasone-inducible mouse mammary tumor virus 
(MMTV)-LTR promoter, followed by SV40 splicing and polyadenylation sites. 
pMAMneo also contains the E. coli neo gene for selection of transformants, as well as 
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the (3-lactamase gene (encoding a protein which imparts ampicillin-resistance) for 
propagation in E. coli. 

Vector pMMTVT7(+) can be generated by modification of pMAMneo 
to remove the two gene and insert the multiple cloning site and T7 and T3 promoters 
5 from pBluescript (Stratagene, La Jolla, CA). Thus, pMMTVT7(+) contains the 
RSV-LTR enhancer linked to the MMTV-LTR promoter, a T7 bacteriophage RNA 
polymerase promoter positioned downstream of the MMTV-LTR promoter, a 
polylinker positioned downstream of the T7 promoter, a T3 bacteriophage RNA 
polymerase promoter positioned downstream of the T7 promoter, and SV40 splicing 

10 and polyadenylation sites positioned downstream of the T3 promoter. The P- 
lactamase gene (encoding a protein which imparts ampicillin-resistance) from 
pMAMneo is retained in pMMTVT7(+), although it is incorporated in the reverse 
orientation relative to the orientation in pMAMneo. 

Vector pMMTVT7(-) is identical to pMMTVT7(+) except that the 

15 positions of the T7 and T3 promoters are switched, i.e., the T3 promoter in 

pMMTVT7(-) is located where the T7 promoter is located in pMMTVT7(+), and the 
T7 promoter in pMMTVT7(-) is located where the T3 promoter is located in 
pMMTVT7<+). Therefore, vectors pMMTVT7(+) and pMMTVT7(-) contain all of 
the regulatory elements required for expression of heterologous DNA in a mammalian 

20 host cell, wherein the heterologous DNA has been incorporated into the vectors at the 
polylinker. In addition, because the T7 and T3 promoters are located on either side of 
the polylinker, these plasmids can be used for synthesis of in vitro transcripts of 
heterologous DNA that has been subcloned into the vectors at the polylinker. 

For inducible expression of human NMDA receptor subunit-encoding 

25 DNA in a mammalian cell, the DNA can be inserted into a plasmid such as 

pMMTVT7(+) or pMMTVT7(-). These plasmids contain the mouse mammary tumor 
virus (MMTV) promoter for steroid-inducible expression of operatively associated 
foreign DNA. If the host cell does not express endogenous glucocorticoid receptors 
required for uptake of glucocorticoids (i.e., inducers of the MMTV promoter) into the 

30 cell, it is necessary to additionally transfect the cell with DNA encoding the 
glucocorticoid receptor (ATCC accession no. 67200). For synthesis of in vitro 
transcripts, full-length human DNA clones encoding human NMDAR1, NMDAR2A, 
NMDAR2B, NMDAR2C and NMDAR2D can also be subcloned into pIBI24 
(International Biotechnologies, Inc., New Haven, CT), pCMV-T7-2, pCMV-T7-3, 
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pMMTVT7(+), pMMTVT7(-), pBluescnpt (Stratagene, La Jolla, CA) or pGEM7Z 
(Promega, Madison, WI). 

In accordance with another embodiment of the present invention, there 
are provided cells containing the above-described polynucleic acids (i.e., DNA or 
5 mRNA). Such host cells as bacterial, yeast and mammalian cells can be used for 
replicating DNA and producing NMDA receptor subunit(s). Methods for assessing 
receptor expression and function are described in PCT Application Nos. 
PCT/US91/05625 and PCT/US92/1 1090, and in co-pending U.S. Application Serial 
Nos. 07/563,751 and 07/812,254. The subject matter of these documents is hereby 
10 incorporated by reference herein in their entirety. 

Incorporation of cloned DNA into a suitable expression vector, 
transfection of eukaryotic cells with a plasmid vector or a combination of plasmid 
vectors, each encoding one or more distinct genes or with linear DNA, and selection 
of transfected cells are well known in the art (see, e.g., Sambrook et al. (1989) 
15 Molecular Cloning: A Laboratory Manual , Second Edition, Cold Spring Harbor 
Laboratory Press). Heterologous DNA may be introduced into host cells by any 
method known to those of skill in the art, such as transfection with a vector encoding 
the heterologous DNA by CaP04 precipitation (see, e.g., Wigler et al. (1979) Proc. 

Natl. Acad. Sci. 76:1373-1376) or Hpofectamine (GIBCO BRL #18324-012). 

20 Recombinant cells can then be cultured under conditions whereby the subunit(s) 
encoded by the DNA is (are) expressed. Preferred cells include mammalian cells 
(e.g., HEK293, CHO, BHKBI and Ltk" cells, mouse monocyte macrophage P388D1 
and J774A-1 cells (available from ATCC, Rockville, MD), and the like), yeast cells 
(e.g., methylotrophic yeast cells, such as Pichia pastoris), bacterial cells (e.g., 

25 Escherichia coli), and the like. 

While the DNA provided herein may be expressed in any eukaryotic 
cell, including yeast cells (such as, for example, P. pastoris (see U.S. Patent Nos. 
4,882,279, 4,837,148, 4,929,555 and 4,855,231), Saccharomyces cerevisiae, Candida 
tropicalis, Hansemda polymorpha, and the like), mammalian expression systems, 

30 including commercially available systems and other such systems known to those of 
skill in the art, for expression of DNA encoding the human NMDA receptor subunits 
provided herein are presently preferred. Xenopus oocytes are preferred for expression 
of in vitro RNA transcripts of the DNA. 

In preferred embodiments, human NMDAR subunit-encoding DNA is 

35 ligated into a vector, and introduced into suitable host cells to produce transformed 
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cell lines that express a specific human NMDA receptor subtype, or specific 
combinations of subunits. The resulting cell lines can then be produced in quantity 
for reproducible quantitative analysis of the effects of known or potential drugs on 
receptor function. In other embodiments, mRNA may be produced by in vitro 
5 transcription of DNA encoding each subunit. This mRNA, either from a single 
subunit clone or from a combination of clones, can then be injected into Xenopus 
oocytes where the mRNA directs the synthesis of the human receptor subunits, which 
then form functional receptors. Alternatively, the subunit-encoding DNA can be 
directly injected into oocytes for expression of functional receptors. The transfected 

10 mammalian cells or injected oocytes may then be used in the methods of drug 
screening provided herein. 

Eukaryotic cells in which DNA or RNA may be introduced include 
any cells that are transfectable by such DNA or RNA or into which such DNA or 
RNA may be injected. Preferred cells are those that can be transiently or stably 

15 transfected and also express the DNA and RNA. Presently most preferred cells are 
those that can form recombinant or heterologous human NMDA receptors comprising 
one or more subunits encoded by the heterologous DNA. Such cells may be 
identified empirically or selected from among those known to be readily transfected 
or injected. 

20 Exemplary cells for introducing DNA include cells of mammalian 

origin (e.g., COS cells, mouse L cells, Chinese hamster ovary (CHO) cells, human 
embryonic kidney (HEK) cells (particularly HEK293 cells that can be frozen in liquid 
nitrogen and then thawed and regrown; for example, those described in U.S. Patent 
No. 5,024,939 to Gorman (see, also, Stillman et al. (1985) Mol. Cell. Biol. 5:2051- 

25 2060)), African green monkey cells and other such cells known to those of skill in the 
art), amphibian cells (e.g., Xenopus laevis oocytes), yeast cells (e.g., Saccharomyces 
cerevisiae, Pichia pastoris), and the like. Exemplary cells for expressing injected 
RNA transcripts include Xenopus laevis oocytes. Cells that are preferred for 
transfection of DNA are known to those of skill in the art or may be empirically 

30 identified, and include HEK293 (which are available from ATCC under accession 
#CRL 1573); Ltk~ cells ( which are available from ATCC under accession #CCL1.3); 
COS-7 cells (which are available from ATCC under accession #CRL 1651); and 
DG44 cells (dhfr CHO cells; see, e.g., Urlaub et al. (1986) Cell. Molec. Genet. 12: 
555). Presently preferred cells include Ltk~ cells and DG44 cells. 
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DNA may be stably incorporated into cells or may be transiently 
expressed using methods known in the art. Stably transfected mammalian cells may 
be prepared by transfecting cells with an expression vector having a selectable marker 
gene (such as, for example, the gene for thymidine kinase, dihydrofolate reductase, 
5 neomycin resistance, and the like), and growing the transfected cells under conditions 
selective for cells expressing the marker gene. To prepare transient transfectants, 
mammalian cells are transfected with a reporter gene (such as the E. coli 6- 
galactosidase gene) to monitor transfection efficiency. Selectable marker genes are 
not included in the transient transfections because the transfectants are typically not 
10 grown under selective conditions, and are usually analyzed within a few days after 
transfection. 

To produce such stably or transiently transfected cells, the cells should 
be transfected with a sufficient concentration of subunit-encoding nucleic acids to 
form human NMDA receptors that contain the human subunits encoded by 
15 heterologous DNA. The precise amounts and ratios of DNA encoding the subunits 
may be empirically determined and optimized for a particular combination of 
subunits, cells and assay conditions. Recombinant cells that express NMDA receptors 
containing subunits encoded only by the heterologous DNA or RNA are especially 
preferred. 

20 Heterologous DNA may be maintained in the cell as an episomal 

element or may be integrated into chromosomal DNA of the cell. The resulting 
recombinant cells may then be cultured or subcultured (or passaged, in the case of 
mammalian cells) from such a culture or a subculture thereof. Methods for 
transfection, injection and culturing recombinant cells are known to the skilled 

25 artisan. Similarly, the human NMDA receptor subunits may be purified using protein 
purification methods known to those of skill in the art. For example, antibodies or 
other ligands that specifically bind to one or more of the subunits may be used for 
affinity purification and immunoprecipitation of the subunit or human NMDA 
receptors containing the subunits. 

30 As used herein, heterologous or foreign DNA and RNA are used 

interchangeably and refer to DNA or RNA that does not occur naturally as part of the 
genome of the cell in which it is present or to DNA or RNA which is found in a 
location or locations in the genome that differ from that in which it occurs in nature. 
Typically, heterologous or foreign DNA and RNA refers to DNA or RNA that is not 

35 endogenous to the host cell and has been artificially introduced into the cell. 
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Examples of heterologous DNA include DNA that encodes a human NMDA receptor 
subunit, DNA that encodes RNA or proteins that mediate or alter expression of 
endogenous DNA by affecting transcription, translation, or other regulatable 
biochemical processes, and the like. The cell that expresses heterologous DNA may 
5 contain DNA encoding the same or different expression products. Heterologous DNA 
need not be expressed and may be integrated into the host cell genome or maintained 
episomally. 

Recombinant receptors on recombinant eukaryotic cell surfaces may 
contain one or more subunits encoded by the DNA or mRNA encoding human 

10 NMDA receptor subunits, or may contain a mixture of subunits encoded by the host 
cell and subunits encoded by heterologous DNA or mRNA. Recombinant receptors 
may be homomeric or may be a heteromeric combination of multiple subunits. 
Mixtures of DNA or mRNA encoding receptors from various species, such as rats and 
humans, may also be introduced into the cells. Thus, a cell can be prepared that 

15 expresses recombinant receptors containing only NMDAR1 subunits, or a 

combination of any one or more NMDAR1 and any one or more NMDAR2 subunits 
provided herein. For example, NMDAR1 subunits of the present invention can be co- 
expressed with NMDAR2A, NMDAR2B, NMDAR2C and/or NMDAR2D receptor 
subunits. Specific examples of heteromeric combinations of recombinant human 

20 NMDAR subunits that have been expressed in Xenopus oocytes include NMDAR1 + 
NMDAR2A, NMDAR 1 + NMDAR2B, and NMDAR 1 + NMDAR2A + NMDAR2C 
(see Example 9). 

The DNA, mRNA, vectors, receptor subunits, receptor subunit 
combinations and cells provided herein permit production of selected NMDA receptor 

25 subunits and specific combinations thereof, as well as antibodies to said receptor 
subunits. This provides a means to prepare synthetic or recombinant receptors and 
receptor subunits that are substantially free of contamination from many other 
receptor proteins whose presence can interfere with analysis of a single NMDA 
receptor subtype. The availability of desired receptor subtypes makes it possible to 

30 observe the effect of a drug substance on a particular receptor subtype or combination 
of NMDA receptor subunits, and to thereby perform initial in vitro screening of the 
drug substance in a test system that is specific for humans and specific for a human 
NMDA receptor subtype or combination of NMDA receptor subunits. The 
availability of specific antibodies makes it possible to identify the subunit 
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combinations expressed in vivo. Such specific combinations can then be employed as 
preferred targets in drug screening. 

The ability to screen drug substances /'// vitro to determine the effect of 
the drug on specific receptor compositions should permit the development and 
5 screening of receptor subtype-specific or disease-specific drugs. Also, testing of 
single receptor subunits or specific combinations of various types of receptor subunits 
with a variety of potential agonists or antagonists provides additional information 
with respect to the function and activity of the individual subunits and should lead to 
the identification and design of compounds that are capable of very specific 

10 interaction with one or more types of receptor subunits or receptor subtypes. The 
resulting drugs should exhibit fewer unwanted side effects than drugs identified by 
screening with cells that express a variety of receptor subtypes. 

Further in relation to drug development and therapeutic treatment of 
various disease states, the availability of DNAs encoding human NMDA receptor 

15 subunits enables identification of any alterations in such genes (e.g., mutations) which 
may correlate with the occurrence of certain disease states. In addition, the creation 
of animal models of such disease states becomes possible, by specifically introducing 
such mutations into synthetic DNA sequences which can then be introduced into 
laboratory animals or in vitro assay systems to determine the effects thereof. 

20 In another aspect, the invention comprises functional peptide 

fragments, and functional combinations thereof, encoded by the DNAs of the 
invention. Such functional peptide fragments can be produced by those skilled in the 
art, without undue experimentation, by eliminating some or all of the amino acids in 
the sequence not essential for the peptide to function as a glutamate receptor. A 

25 determination of the amino acids that are essential for glutamate receptor function is 
made, for example, by systematic digestion of the DNAs encoding the peptides and/or 
by the introduction of deletions into the DNAs. The modified (e.g., deleted or 
digested) DNAs are expressed, for example, by transcribing the DNA and then 
introducing the resulting mRNA into Xenopus oocytes, where translation of the 

30 mRNAs will occur. Functional analysis of the proteins thus expressed in the oocytes 
is accomplished by exposing the oocytes to ligands known to bind to and functionally 
activate glutamate receptors, and then monitoring the oocytes to see if the expressed 
fragments form ion channel(s). If ion channel(s) are detected, the fragments are 
functional as glutamate receptors. 
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The above-described method can be carried out in the presence of 
NMDARl-like receptor subunits alone, or in the presence of combinations of 
NMDARl-like and NMDAR2-like receptor subunits. Thus, for example, when the 
protein being tested is an NMDAR2-hke receptor subunit, the additional subunit is 
5 preferably an NMDARl-like subunit. 

In accordance with still another embodiment of the present invention, 
there is provided a method for identifying compounds which bind to human 
N-methyl-D-aspartate (NMDA) receptor subunit(s), said method comprising 
employing receptor proteins of the invention in a competitive binding assay. Such an 

10 assay can accommodate the rapid screening of a large number of compounds to 
determine which compounds, if any, are capable of binding to NMDA receptors. 
Subsequently, more detailed assays can be carried out with those compounds found to 
bind, to further determine whether such compounds act as modulators, agonists or 
antagonists of invention receptors. 

15 Another application of the binding assay of the invention is the assay 

of test samples (e.g., biological fluids) for the presence or absence of receptors of the 
present invention. Thus, for example, serum from a patient displaying symptoms 
related to glutamatergic pathway dysfunction can be assayed to determine if the 
observed symptoms are perhaps caused by over- or under-production of such 

20 receptor(s). 

The binding assays contemplated by the present invention can be 
carried out in a variety of ways, as can readily be identified by those of skill in the art. 
For example, competitive binding assays can be employed, such as radioreceptor 
assays, and the like. 

25 In accordance with a further embodiment of the present invention, 

there is provided a bioassay for identifying compounds which modulate the activity of 
human NMDA receptors of the invention, said bioassay comprising: 

(a) exposing cells containing DNA encoding human NMDA 
receptor subunit(s), wherein said cells express functional 

30 NMDA receptors, to at least one compound whose ability to 

modulate the ion channel activity of said receptors is sought to 
be determined; and thereafter 

(b) monitoring said cells for changes in ion channel activity. 
The above-described bioassay enables the identification of agonists 

35 and antagonists for human NMDA receptors. According to this method, recombinant 
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NMDA receptors are contacted with an "unknown" or test substance (in the further 
presence of a known NMDA agonist, when antagonist activity is being tested), the ion 
channel activity of the known glutamate receptor is monitored subsequent to the 
contact with the "unknown" or test substance, and those substances which increase or 
5 decrease the ion channel response of the known glutamate receptor(s) are identified as 
functional ligands (i.e., modulators, agonists or antagonists) for human NMDA 
receptors. 

In accordance with a particular embodiment of the present invention, 
recombinant human NMDA receptor-expressing mammalian cells or oocytes can be 

10 contacted with a test compound, and the modulating effect(s) thereof can then be 
evaluated by comparing the NMDA receptor-mediated response in the presence and 
absence of test compound, or by comparing the response of test cells, or control cells 
(i.e., cells that do not express NMDA receptors), to the presence of the compound. 

As used herein, a compound or signal that "modulates the activity of an 

15 NMDA receptor" refers to a compound or signal that alters the activity of NMDA 
receptors so that activity of the NMDA receptor is different in the presence of the 
compound or signal than in the absence of the compound or signal. In particular, such 
compounds or signals include agonists and antagonists. The term agonist refers to a 
substance or signal, such as NMDA, that activates receptor function; and the term 

20 antagonist refers to a substance that interferes with receptor function. Typically, the 
effect of an antagonist is observed as a blocking of activation by an agonist. 
Antagonists include competitive and non-competitive antagonists. A competitive 
antagonist (or competitive blocker) interacts with or near the site specific for the 
agonist (e.g., ligand or neurotransmitter). A non-competitive antagonist or blocker 

25 inactivates the functioning of the receptor by interacting with a site other than the site 
that interacts with the agonist. 

As understood by those of skill in the art, assay methods for 
identifying compounds that modulate human NMDA receptor activity (e.g., agonists 
and antagonists) generally require comparison to a control. One type of a "control" 

30 cell or "control" culture is a cell or culture that is treated substantially the same as the 
cell or culture exposed to the test compound, except the control culture is not exposed 
to test compound. For example, in methods that use voltage clamp 
electrophysiological procedures, the same cell can be tested in the presence and 
absence of test compound, by merely changing the external solution bathing the cell. 

35 Another type of "control" cell or "control" culture may be a cell or a culture of cells 
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which is identical to the transfected cells, except the cells employed for the control 
culture do not express functional human NMDA receptor subunits. In this situation, 
the response of test cell to test compound is compared to the response (or lack of 
response) of receptor-negative (control) cell to test compound, when cells or cultures 
5 of each type of cell are exposed to substantially the same reaction conditions in the 
presence of compound being assayed. 

In accordance with yet another embodiment of the present invention, 
the ion channel activity of human N-methyl-D-aspartate (NMDA) receptors can be 
modulated by contacting such receptors with an effective amount of at least one 

10 compound identified by the above-described bioassay. 

In accordance with yet another embodiment of the present invention, 
there are provided antibodies generated against the above-described receptor proteins. 
Such antibodies can be employed for studying receptor tissue localization, subunit 
composition, structure of functional domains, as well as in diagnostic applications, 

15 therapeutic applications, and the like. Preferably, for therapeutic applications, the 
antibodies employed will be monoclonal antibodies. 

The above-described antibodies can be prepared employing standard 
techniques, as are well known to those of skill in the art, using the invention receptor 
proteins or portions thereof as antigens for antibody production. Both anti-peptide 

20 and anti-fusion protein antibodies can be used [see, for example, Bahouth et al. (1991) 
Trends Pharmacol Sci . vol. 12:338-343; Current Protocols in Molecular Biology 
(Ausubel et al., eds.) John Wiley and Sons, New York (1989)]. Factors to consider in 
selecting portions of the NMDAR subunits for use as immunogen (as either a 
synthetic peptide or a recombinantly produced bacterial fusion protein) include 

25 antigenicity, accessibility (i.e., extracellular and cytoplasmic domains), uniqueness to 
the particular subunit, etc. 

The availability of subunit-specific antibodies makes possible the 
application of the technique of immunohistochemistry to monitor the distribution and 
expression density of various subunits (e.g., in normal vs. diseased brain tissue). Such 

30 antibodies could also be employed for diagnostic and therapeutic applications. 

In accordance with still another embodiment of the present invention, 
there are provided methods for modulating the ion channel activity of receptor(s) of 
the invention by contacting said receptor(s) with an effective amount of the above- 
described antibodies. 
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The antibodies of the invention can be administered to a subject 
employing standard methods, such as, for example, by intraperitoneal, intramuscular, 
intravenous, or subcutaneous injection, implant or transdermal modes of 
administration, and the like. One of skill in the art can readily determine dose forms, 
5 treatment regiments, etc, depending on the mode of administration employed. 

The invention will now be described in greater detail by reference to 
the following non-limiting examples. 

EXAMPLE 1 

10 Isolation of DNA encoding human NMDA receptor NMDAR1 subunits 

A. cDNA Library Screening 

RNA isolated from human hippocampus tissue was used as a template 
for the synthesis of oligo dT-primed and randomly primed, single-stranded cDNA 

15 according to standard procedures [see, for example, Maniatis et al. (1982) Molecular 
Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY]. The single-stranded cDNA was converted to double-stranded cDNA, 
and EcoRVSnaBVXhol adaptors were added to the ends thereof. The cDNAs were 
separated by size using agarose gel electrophoresis, and those that were >2.0 kb were 

20 ligated into £coRI-digested A.gtl0 bacteriophage vectors. The resulting cDNA library 
was amplified by replication of each clone through limited infection of a bacterial 
host, and stored at -70°C. 

The amplified hippocampus oligo dT-primed cDNA library was later 
retrieved from storage and 1 x 106 recombinants were screened for hybridization to 

25 oligonucleotides corresponding to nucleotides 96-128 (SE7) and nucleotides 2576- 
2609 (SE8) of the rat NMDAR1 A receptor cDNA (see Monyoshi et al. (1991) Nature 
554:31). Hybridization was performed at 42°C in 6X SSPE, 5X Denharts solution, 
10% formamide, 0.2% SDS and 200 }ig/ml herring sperm DNA. Washes were 
performed in IX SSPE and 0.2% SDS at 50°C. Hybridizing clones (e.g. NMDA 1-3) 

30 were identified. These clones hybridized to SE8 but not to SE7. 

A randomly primed primary human hippocampus cDNA library (~2 x 
105 recombinants prepared by selecting only cDNAs >2.0 kb for inclusion in the 
library) was screened under the same conditions for hybridization to oligonucleotide 
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SE8 and an oligonucleotide corresponding to nucleotides 129-141 of the rat 
NMDAR1A receptor cDNA (SE1 1). Five hybridizing clones, which hybridized to 
SE8 and not to SE1 1, were identified: NMDA5-7 and NMDA10-1 1. 

5 B. Characterization of Clones 

The clones were plaque purified and characterized by restriction 
enzyme mapping and DNA sequence analysis of the inserts. One of the clones, 
NMDA1 1 (see Sequence ID No. IB for a description of a portion of NMD A 1 l),is a 

10 full-length cDNA (i.e., it contains translation initiation and termination codons) 
encoding a complete NMDAR1 subunit. The remaining clones are partial cDNAs. 
Clones NMDA2, NMDA3 (see Sequence ID No. ID), NMDA5, NMDA6, NMDA7 
(see Sequence ID No. 1C), and NMDA10 (see Sequence ID No. 1A for a description 
of a portion of NMDA10) contain a translation termination codon but lack nucleotides 

1 5 at the 5' end of the coding sequence. 

Characterization of the clones revealed that the isolated cDNAs 
correspond to different alternatively spliced forms of the human NMDAR1 subunit 
transcript. The four types of alternate splicing represented by the clones are depicted 
schematically in Figure 1. Clone NMDA10 (which lacks 5' untranslated sequences as 

20 well as 60 nucleotides of the 5' end of the coding sequence) is used as a reference to 
which the other variants are compared. Clone NMDA1 1 lacks 363 nucleotides (in the 
3' portion of the clone) that are present in NMDA10. This 363-nucleotide deletion 
does not disrupt the reading frame of the transcript; however, it results in a different 
termination codon. The last 69 nucleotides of the coding sequence of NMDA1 1 

25 correspond to 3' untranslated sequence of clone NMDA10 (i.e., nucleotides 3325- 
3393 of Sequence ID No. 1). Clone NMDA7 lacks the same 363-nucleotide sequence 
that is deleted from NMDA1 1; however, NMDA7 further lacks 204 nucleotides at the 
5' end that are present in NMDA10 and NMDA11. This 204-nucleotide deletion also 
does not disrupt the reading frame of the transcript. Additionally, NMDA7 contains a 

30 63-nucleotide in-frame insertion at the 5' end relative to NMDA10 and NMDA1 1. 
The last 69 base pairs of the coding sequence of NMDA7 correspond to 3 1 
untranslated sequence of NMDA10 i.e., nucleotides 3325-3393 of Sequence ID No. 
1). Clone NMDA3 lacks 1087 base pairs at the 3' end that are present in NMDA10. 
This 1087-base pair deletion does not disrupt the reading frame of the transcript: 

35 however it results in a different termination codon. The last 231 base pairs of the 
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coding sequence of NMDA3 correspond to 3* untranslated sequence of clone 
NMDA10 (i.e., nucleotides 4049-4279 in Sequence ID No. 1 ). 

EXAMPLE 2 

5 Preparation of full-length NMDAR1 subunit cDNA constructs 

Portions of clones NMDA 10, NMD A 1 1 , NMDA7 and NMDA3 were 
ligated together to construct full-length cDNAs encoding variants of the NMDA 
receptor NMDAR1 subunit. The full-length NMDAR1 subunit cDNAs were 
incorporated into vector pcDNAl (Invitrogen, San Diego, CA) for use in expressing 

10 the receptor subunits in mammalian host cells and for use in generating in vitro 
transcripts of the DNAs to be expressed in Xenopus oocytes. 

Vector pcDNAl is a pUC19-based plasmid that contains the following 
elements in the 5 -to-3' order: the cytomegalovirus (CMV) immediate early gene 
promoter/enhancer, the bacteriophage T7 RNA polymerase promoter, a polylinker, 

15 the bacteriophage SP6 RNA polymerase promoter, SV40 RNA processing (i.e., splice 
donor/acceptor) signals, SV40 polyadenylation signal, and the ColEl origin and supF 
suppressor tRNA to permit maintenance of the vector in Escherichia coli strains with 
the P3 episome. This vector thus contains all the regulatory elements required for 
expression of heterologous DNA in a mammalian host cell, wherein the heterologous 

20 DNA has been incorporated into the vector at the polylinker. In addition, because the 
T7 and SP6 promoters are located on either side of the polylinker, this plasmid can be 
used for synthesis of in vitro transcripts of heterologous DNA that has been sublconed 
into the vector at the polylinker. 

25 A. NMDAR1A 

Full-length construct NMDAR1A was prepared by ligation of a 5' 
portion of NMDA1 1 (beginning 5' of the translation initiation codon and extending to 
the HindlW site in the middle of the clone) and a 3 1 portion of NMDA 10 (beginning at 
30 the HindlW site in the middle of the clone and extending 3' of the translation 

termination codon) as depicted in Figure 2. The two DNA fragments were joined in 
mammalian expression vector pcDNAl. 
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Initially, the strategy for generating the NMDAR1 construct involved a 
first step of separately subcloning the entire 4.0 kb EcoRl insert fragment of 
NMDA10 and the entire 4.0 kb SnaBl insert fragment of NMDA1 1 into pcDNAl; 
however, two attempts employing this cloning strategy were unsuccessful. It appeared 
5 that there may have been selection against E. coli hosts retaining the complete insert 
fragments since the surviving recombinant E. coli that were analyzed contained 
incomplete insert cDNAs from which nucleotides had been deleted. Therefore, it was 
necessary to prepare the full-length NMDAR1A construct in several steps by 
subcloning and combining various fragments of NMDA10 and NMDA1 1 in pcDNAl 

10 as follows (see Figure 3 for locations of restriction enzyme sites ). 

Clone NMDA10 was digested with Bglll and EcoRl and the -3.3 kb 
fragment containing nucleotides 1020-4298 of Sequence ID No. 1 was isolated and 
subcloned into BamHVEcoRl-digested pcDNAl. The resulting plasmid was digested 
with Hindlll and Nhel and the fragment containing nucleotides 2137-4298 of 

15 Sequence ED No. 1 plus a portion of pcDNAl was isolated. 

Clone NMDA1 1 was digested with EcoRl and Hindlll and the -2.1 kb 
fragment containing nucleotides 1-2136 of Sequence ID No. 1 was isolated and 
subcloned into £coRI////>2dIII-digested modified pcDNAl (modified by deletion of 
the Hindlll site located 5' of the EcoRl site in the polylinker and addition of a Hifidlll 

20 site into the polylinker at a position 3' of the EcoRl site). The resulting plasmid was 
digested with Nhel and Hindlll and the fragment containing nucleotides 1-2136 of 
Sequence ED No. 1 plus a portion of modified pcDNAl was isolated. This 
NheVHindlll fragment was then ligated to the HindllUNhel fragment containing 
nucleotides 2137-4298 of Sequence ID No. 1 to generate the full-length construct 

25 NMDAR1A (see Figure 2). The ligation mix was used to transform E. coli strain 
MC1061/P3. Because the Nhel site in pcDNAl occurs within the supF selection 
gene, only E. coli containing the correctly ligated, complete NMDAR1 A plasmid 
(which has the complete, functional selection gene) were able to survive the selection 
process. This fragment subcloning strategy enabled selection of the desired correct 

30 NMDAR1 A-containing E. coli host cells, even though the total number of such 
recombinant host cells was small. 

In summary, construct NMDAR1A contains 261 base pairs of 5 1 
untranslated sequence from NMDAR11 (nucleotides 1-261 of Sequence ID No. 1) 
and a complete coding sequence (nucleotides 262-3078 of Sequence ID No.l ) for the 

35 NMDAR1A variant of the NMDAR1 subunit as well as 1220 base pairs of 3' 
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untranslated sequence (nucleotides 3079-4298 of Sequence ID No. 1). The 
NMDAR1 A-encoding sequence is operatively linked to the regulatory elements in 
pcDNAl for expression in mammalian cells. 

5 B. NMDAR1-A363 

Full-length construct NMDAR1-A363 was prepared by ligation of a 5' 
portion of NMD A 1 1 (beginning 5' of the translation initiation codon and extending to 
the Hindlll site in the middle of the clone, i.e., nucleotides 1-2136 in Sequence ID No. 

10 1) and a 3 1 portion of NMDA11 (beginning at the Hindlll site in the middle of the 
clone and extending 3' of the translation termination codon, i.e., nucleotides 2137- 
2961 and 3325-4298 of Sequence ID No. 1). As described above, due to the difficulty 
in directly subcloning the entire 4.0 kb SnaBl NMDA1 1 insert into pcDNAl, it was 
necessary to generate the construct by ligating two fragments of the NMDA1 1 insert 

15 into pcDNAl as follows (see Figure 3 for locations of restriction enzyme sites). 

To obtain the 5'NMDA1 1 fragment, clone NMDA1 1 was digested 
with EcoRl and Hindlll and the -2.2 kb fragment containing nucleotides 1-2136 of 
Sequence ID No. 1 was isolated and subcloned into £coRI/////zdIII-digested modified 
pcDNAl (modified as described above). The resulting plasmid was digested with 

20 Nhel and Hindlll and the fragment containing nucleotides 1-2136 of Sequence ID No. 
1 plus a portion of modified pcDNAl was isolated. 

To obtain the 3'NMDA1 1 fragment, clone NMDA1 1 was digested 
with Bglll and EcoRl and the 3.0 kb fragment containing nucleotides 1020-2961 and 
3325-4298 of Sequence ID No. 1 was isolated and subcloned into BamHUEcoRI- 

25 digested pcDNAl. The resulting plasmid was digested with Hindlll and Nhel and the 
fragment containing nucleotides 2137-2961 and 3325-4298 of Sequence ID No. 1 plus 
a portion of pcDNAl was isolated. This Hindlll/ Nhel fragment was then ligated to 
the Nhel/ Hindlll fragment containing nucleotides 1-2136 of Sequence ED No. 1 to 
generate NMDAR1-A363. 

30 In summary, construct NMDAR1-A363 contains 261 base pairs of 5' 

untranslated sequence (nucleotides 1-261 of Sequence ID No. 1) and a complete 
coding sequence for the NMDAR1-A363 variant NMDAR1 subunit (nucleotides 262- 
2961 and 3325-3393 of Sequence ID No. 1) as well as 905 base pairs of 3' 
untranslated sequence (nucleotides 3394-4298 of Sequence ID No. 1). Thus, 

35 NMDAR1-A363 differs from NMDAR1 in that it lacks 363 nucleotides (nucleotides 

- 30 - 



SD9383 




2962-3324 of Sequence ID No. 1 ) that comprise the last 1 1 7 nucleotides of the coding 
sequence and the first 246 nucleotides of the 3* untranslated sequence of NMDAR 1 . 
The NMDAR1-A363 subunit variant-encoding sequence is operatively linked to the 
regulatory elements in pcDNAl for expression in mammalian cells. 

5 

C. NMDAR1-A1087 

Full-length construct NMDAR1-A1087 was prepared by replacing the 
3' end of the NMDAR1 variant-encoding insert of NMDAR1-A363 with a fragment 

10 from the 3' end of clone NMDA3 (see Figure 2). Plasmid NMDAR1-A363 was 
partially digested with Pstl and completely digested with Xbal. There is a Pstl site 
-112 nucleotides upstream of the location of the 363-nucleotide deletion in 
NMDAR1-A363 and an Xbal site in the polylinker located downstream of the 3' 
untranslated sequence of NMDAR1-A363 (see Figure 3). Thus, PstVXbal digestion 

15 of NMDAR1-A363 results in removal of a fragment containing nucleotides 2850- 
2961 and 3325-4298 of Sequence ED No. 1 from the vector. The larger fragment was 
isolated from the digest. 

The insert of clone NMDA3 was cloned into the EcoRI restriction 
site(s) of pGEM (Promega, Madison, WI); and the resulting plasmid was digested 

20 with Pstl and Xbal. The smaller fragment containing nucleotides 2850-2961 and 

4049-4298 of Sequence ID No. 1 was isolated and ligated to the larger fragment from 
the PstVXbal digest of NMDAR 1-A363. The resulting construct was designated 
NMDAR1-A1087. 

In summary, NMDAR 1 -A 1087 contains 261 base pairs of 5' 

25 untranslated sequence (nucleotides 1-261 in Sequence ID No. 1), the complete coding 
sequence for the NMDAR 1 -A 1087 variant NMDAR 1 subunit (nucleotides 262-2961 
and 4049-4279 of Sequence ID No. 1) and 19 base pairs of 3' untranslated sequence 
(nucleotides 4280-4298 of Sequence ID No. 1). Thus, NMDAR1-A1087 differs from 
NMDAR 1 in that it lacks 1087 nucleotides (nucleotides 2962-4048 of Sequence ID 

30 No. 1) that comprise the last 1 17 nucleotides of the coding sequence and the first 970 
nucleotides of the 3' untranslated sequence of NMDAR1. The NMDAR1-A1087 
subunit variant-encoding sequence is operatively linked to the regulatory elements in 
pcDNAl for expression in mammalian cells. 

35 D. NMDAR1-I63-A204 
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Full-length construct NMDAR 1-I63-A204 was prepared by replacing a 
1399-nucleotide fragment of construct NMDAR 1 A (i.e, nucleotides 738-2136 of 
Sequence ID No. 1) with the Pv//II-f///idIII fragment of NMDA7 (i.e., nucleotides 
5 738-831 of sequence ID No. 1, plus nucleotides 1-63 of Sequence ID No. 3 and 
nucleotides 832-984 and 1189-2136 of Sequence ID No. 1), as depicted in Figure 2. 
Because there are multiple Pvull sites in the NMDAR 1 construct, a several-step 
process was required for construction of NMDAR 1 -163- A204 as follows (see Figure 3 
for the location of restriction enzyme sites). 

10 The ~2.2-kb EcoRl-Hindlll fragment isolated from construct 

NMDAR 1 A and containing nucleotides 1-2136 of Sequence ID No. 1 was ligated 
with modified pcDNAl (modified as described in Example 2 A) that had been 
digested with EcoRl and Hindlll. The resulting plasmid was digested with Avrll and 
self-ligated to remove two Pvidl sites from a portion of the plasmid contributed by 

15 pcDNAl. The plasmid was then partially digested with PvuU and completely 

digested with HindUl. The digest was ligated with a 1258-nucleotide Pvull-Hindlll 
fragment isolated from clone NMDA7. The resulting plasmid, designated NMDAR 1- 
I63-A204-5', was digested with BamHl and Hindlll and the ~2-kb fragment 
containing nucleotides 1-831 of Sequence ID No. 1, plus nucleotides 1-63 of 

20 Sequence ID No. 3 and nucleotides 832-984 and 1 189-2136 of Sequence ID No. 1 
was isolated and ligated to /ta/??HI///z;zdIII-digested NMDAR 1 to generate 
NMDAR1-I63-A204. 

NMDAR 1 -163- A204 contains 261 base pairs of 5' untranslated 
sequence (nucleotides 1-261 in Sequence ID No. 1), the complete coding sequence for 

25 the NMDAR1-I63-A204 variant NMDAR1 subunit (nucleotides 262-83 1 of Sequence 
ID No. 1 plus nucleotides 1-63 of Sequence ID No. 3 and nucleotides 832-984 and 
1 189-3078 of Sequence ID No. 1) and 1220 base pairs of 3' untranslated sequence 
(nucleotides 3079-4298 of Sequence ID No. 1). Thus NMDAR1-I63-A204 differs 
from NMDAR 1 in that it contains 63 nucleotides that are not present in NMDAR 1 

30 (nucleotides 1-63 of Sequence ID No. 3) located between nt 831 and 832 of Sequence 
ID No. 1. Further, NMDAR 1 -I63-A204 lacks 204 nucleotides that are present in 
NMDAR 1 (nucleotides 985-1 188 of Sequence ID No. 1). The NMDAR1-I63-A204 
subunit variant-encoding sequence is operatively linked to the regulatory elements in 
pcDNAl for expression in mammalian cells. 
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E. NMDAR1-I63 

Full-length construct NMDAR 1-163 can be described as NMDAR1 in 
5 which a 173-bp fragment (nucleotides 738-910 of Sequence ID No. 1) is replaced 
with the 236-bp Pvull-Smal fragment of NMDA7 (nucleotides 738-831 of Sequence 
ID No. 1, plus nucleotides 1-63 of Sequence ID No. 3 and nucleotides 832-910 of 
Sequence ID No. 1). Because there are multiple PvuU sites in the NMDAR 1 
construct, a several-step process was required for construction of NMDAR 1-163 as 

10 follows. Plasmid NMDAR 1 -163- A204-5' was partially digested with Smal and 
completely digested with Hindlll. The larger vector fragment was ligated with the 
1226-bp SmaVHindlll fragment isolated from NMDA1 1 (nucleotides 91 1-2136 of 
Sequence ID No. 1 ). The resulting vector was digested with BamHl and Hindlll and 
the ~2.2-kb fragment containing nucleotides 1-831 of Sequence ID No. 1, plus 

15 nucleotides 1-63 of Sequence ID No. 3 and nucleotides 832-2136 of Sequence ID No. 
1 was isolated and ligated to £amHI/////zdIII-digested NMDAR 1 to generate 
NMDAR 1-163. 

NMDAR 1-163 contains 261 base pairs of 5' untranslated sequence 
(nucleotides 1-261 in Sequence ID No. 1), the complete coding sequence for the 

20 NMDAR 1-163 variant NMDAR 1 subunit (nucleotides 262-831 of Sequence ID No. 1, 
plus nucleotides 1-63 of Sequence ID No. 3 and nucleotides 832-3078 of Sequence ID 
No. 1) and 1220 nucleotides of 3' untranslated sequence (nucleotides 3079-4298 of 
Sequence ID No. 1). Thus, NMDAR 1-163 differs from NMDAR 1 in that it contains 
63 nucleotides that are not present in NMDAR 1 (nucleotides 1-63 of Sequence ID No. 

25 3), located between nucleotides 831 and 832 of Sequence ID No. 1. The 

NMDAR 1-163 subunit variant-encoding sequence is operatively linked to the 
regulatory elements in pcDNAl for expression in mammalian cells. 

F. NMDAR1-I63-A204-A363 

30 

Full-length construct NMDAR1-I63-A204-A363 was prepared by 
replacing the 2861 nucleotide fragment from construct NMDAR 1 -I63-A204 (ie. 
nucleotides 1438-4298 Sequence ID. No. 1) with the Kpnl-Xbal (polylinker site) 
fragment of NMDAR 1 -A363 (ie. nucleotides 1438-2961 and 3325-4298 of Sequence 
35 ID No. 1) as depicted in Figure 2. The NMDAR1 -I63-A204 was completely digested 
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with Xbal then partially digested with Kpnl due to the presence of two additional 
Kpnl sites in the vector sequence. The resulting 5' NMDAR1 -I63-A204 fragment, 
which includes the pcDNAI vector sequences, was ligated with the 3' Kpn\-Xba\ 
fragment from NMDAR1--A363 to generate NMDAR1 -I63-A204-A363. 
5 In summary, construct NMDAR1-I63-A204-A363 contains 261 base 

pairs of 5' untranslated sequence (nucleotides 1-261 in Sequence ID No. 1), the 
complete coding sequence for the NMDAR1-I63-A204-A363 variant NMDAR1 A 
subunit (nucleotides 262-831 of Sequence ID No. 1, plus nucleotides 1-63 of 
Sequence ID No. 3, plus nucleotides 832-984, 1 189-2961 and 3325-3393 of Sequence 

10 ID No. 1) as well as 905 base pairs of 3' untranslated sequence (nucleotides 3394- 
4298 of Sequence ID. No. 1). Thus, NMDAR1-I63-A204-A363 differs from 
NMDAR1A in that it contains 63 nucleotides that are not present in NMDAR1A 
(nucleotides 1-63 of Sequence ID No. 3) located between nucleotides 831 and 832 of 
Sequence ID No. 1. Further, NMDAR 1 -I63-A204-A363 lacks 204 nucleotides that 

15 are present in NMDAR1A (nucleotides 985-1188 of Sequence ID No. 1) and 363 
nucleotides that are present in NMDAR1A (nucleotides 2962-3324 of Sequence ID 
No. 1) that comprise the last 1 17 nucleotides of the coding sequence and the first 246 
nucleotides of the 3' untranslated sequence of NMDAR1 A. The NMDAR 1-163- 
A204-A363 subunit variant encoding sequence is operatively linked to the regulatory 

20 elements in pcDNAI for expression in mammalian cells. 

G. NMDAR1-I63-A204-A1087 

Full-length construct NMDAR1-I63-A204-A1087 was prepared by 
25 replacing the 2861 nucleotide fragment from construct NMDAR1-I63-A204 (ie, 
nucleotides 1438-4298 Sequence ID. N. 1) with the KpnI-Xbal (polylinker site) 
fragment of NMDAR1-A1087 (ie, nucleotides 1438-2961 and 4049-4298 of Sequence 
ID No. 1) as depicted in Figure 2. The NMDAR 1 -163- A204 was completely digested 
with Xbal then partially digested with Kpnl due to the presence of two additional 
30 Kpnl sites in the vector sequence. The resulting 5' NMDAR1-I63-A204 fragment, 
which includes the pcDNAI vector sequences, was ligated with the 3'KpnI-XbaI 
fragment from NMDAR1-A1087 to generate NMDAR1-I63-A204-A1087. 
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In summary, construct NMDAR1-I63-A204-A1087 contains 261 base 
pairs of 5' untranslated sequence (nucleotides 1-261 in Sequence ID No. 1), the 
complete coding sequence for the NMDAR1-I63-A204-A363 variant NMDAR1A 
subunit (nucleotides 262-831 of Sequence ID No. 1, plus nucleotides 1-63 of 
5 Sequence ID No. 3, plus nucleotides 832-984, 1189-2961 and 4280-4298 of Sequence 
ID No. 1) as well as 19 base pairs of 3' untranslated sequence (nucleotides 4280-4298 
of Sequence ID. No. 1). Thus, NMDAR1-I63-A204-A1087 differs from NMDAR1A 
in that it contains 63 nucleotides that are not present in NMDAR1A (nucleotides 1-63 
of Sequence ID No. 3) located between nucleotides 831 and 832 of Sequence ID No. 

10 1. Further, NMDAR1-I63-A204-A1087 lacks 204 nucleotides that are present in 
NMDAR1 A (nucleotides 985-1 188 of Sequence ID No. 1) and 1087 nucleotides that 
are present in NMDAR1A (nucleotides 2962-4048 of Sequence ID No. 1) that 
comprise the last 1 17 nucleotides of the coding sequence and the first 970 nucleotides 
of the 3' untranslated sequence of NMDAR1A. The NMDAR1-I63-A204-A1087 

15 subunit variant encoding sequence is operatively linked to the regulatory elements in 
pcDNAI for expression in mammalian cells. 



20 



H. Additional Constructs Containing Full-Length cDNAs Encoding 

Variants of the NMDAR1 Subunit 

Additional full-length cDNAs encoding further possible NMDAR1 
variants can be constructed using methods similar to those described in Examples 2A- 
G above. Specifically, the following constructs can be prepared by ligating portions 
of clones NMDA1 1, NMD A 10, NMDA7 and NMD A3 as depicted in Figure 2: 



25 



NMDAR1-A204 (Sequence ID No. 1 J) 

NMDAR1-A204-A363 (Sequence ID No. IK) 

NMDAR1-I63-A363 (Sequence ID No. 1M) 

NMDAR1-I63-A1087 (Sequence ID No. IN) 

30 NMDAR1-A204-A1087 (Sequence ID No. 1L) 
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The full-length cDNAs can also be incorporated into mammalian expression vectors 
such as pcDNAl, as described in Examples 2A-G. 

Several methods can be employed to determine which NMDAR1 
5 subunit variants are actually expressed in various human tissues. For example, 
oligonucleotides specific for the nucleotide sequences located 5' and 3' of the 
insertions and deletions of the NMDAR1 transcripts described herein can be used to 
prime nucleic acid amplifications of RNA isolated from various tissues and/or cDNA 
libraries prepared from various tissues. The presence or absence of amplification 
10 products and the sizes of the products indicate which variants are expressed in the 
tissues. The products can also be characterized more thoroughly by DNA sequence 
analysis. 

RNase protection assays can also be used to determine which variant 
transcripts are expressed in various tissues. These assays are a sensitive method for 

15 detecting and quantitating an RNA species in a complex mixture of total cellular 
RNA. A portion of the NMDAR1 subunit variant DNA is labeled and hybridized 
with cellular RNA. If complementary mRNA is present in the cellular RNA, a DNA- 
RNA hybrid results. The RNA sample is then treated with RNase, which degrades 
single-stranded RNA. Any RNA-DNA hybrids are protected from RNase degradation 

20 and can be visualed by gel electrophoresis and autoradiography. 

Further information on possible splice variants of the NMDAR1 
primary transcript can be obtained by isolation of genomic clones containing 
NMDAR1 subunit-encoding sequences (for example, by hybridization to the human 
NMDAR1 subunit cDNAs disclosed herein) and subsequent characterization of the 

25 resulting clones. 

EXAMPLE 3 

Isolation of DNA Encoding Human NMDA Receptor NMDAR2C Subunits 

Degenerate oligonucleotides were synthesized based on two conserved 
30 regions of rat NMDAR2A, NMDAR2B and NMDAR2C DNAs that encode the 
putative first and fourth transmembrane domains. In rat NMDAR2A DNA, these 
regions are encoded by nucleotides 1669-1692 (oligo SE74) and 2437-2465 (olig 
SE75), respectively, [see Monyer et al ( 1992) Science 256: 1217-1221]. These 
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oligonucleotides were used to pnme nucleic acid amplification of cDNAs prepared 
from RNA isolated from human hippocampus, cerebellum, and orbitofrontal tissue. 
Two products, a 795-bp and a 640-bp fragment, were detected when the reaction 
mixture was analyzed by gel electrophoresis and ethidium bromide staining. The 
5 795-bp fragment amplified from the cerebellum cDNA was subcloned into PGR 1000 
(Invitrogen, San Diego, CA) and characterized by DNA sequence analysis, which 
revealed that it is -867c similar to the rat NMDAR2A DNA sequence, -78% similar 
to the rat NMDAR2B DNA sequence, and -747c similar to the rat NMDAR2C DNA 
sequence. Thus, this plasmid was named pcrNMDAR2A. 

10 The 795-bp insert from pcrNMDAR2A was used to screen 1 x 106 

recombinants of a human hippocampus cDNA library (prepared by using random 
primers to synthesize cDNAs from hippocampus tissue and selecting fragments >2.0 
kb for insertion into XgtlO vectors) and a human cerebellum cDNA library (random- 
primed library size-selected for fragments >2.8 kb in /^gtlO). Hybridization was 

15 performed in 5X SSPE, 5X Denhart's solution, 50% deionized formamide, 0.2% SDS, 
200 |ng/ml sonicated, denatured herring sperm DNA at 42°C. Washes were performed 
in IX SSPE, 0.2% SDS at 55°C. The probe hybridized to 1 1 plaques from the 
hippocampus library and 8 plaques from the cerebellum library. 

DNA sequence analysis and/or restriction enzyme mapping of 15 of 

20 the hybridizing plaques that were purified surprisingly revealed that they were more 
similar to rat NMDAR2C DNA than to rat NMDAR2A DNA. All of the clones were 
partial cDNAs (i.e., they lacked a translation initiation and/or termination codon) and 
were designated as NMDAR2C cDNAs. Comparison of the clones revealed that the 
human NMDAR2C subunit transcript is differentially processed. 

25 Clones NMDA26, NMDA24, NMDA22 and NMDA21 (see Figure 4) 

represent four basic clones that were identified, all of which are believed to be splice 
variants. Clone NMDA26 (Sequence ID No. 5D) is used as a reference to which the 
other variants can be compared. Clone NMDA24 (Sequence ID No. 5C) contains a 
24-bp sequence (see Sequence ED No. 7) that is not present in NMDA26. Clone 

30 NMDA22 (Sequence ID No. 5B) lacks 15 bp that are present in NMDA26, and clone 
NMDA21 (Sequence ID No. 5 A) lacks 51 bp that are present in NMDA26. Clones 
NMDA22 and NMDA24 both contain an 1 1-bp sequence (Sequence ID No. 9) that is 
not present in NMDA26 (between nucleotides 1 1 16-1 1 17 of Sequence ID No. 5). 
Introduction of this sequence into these clones (between nucleotides 1 1 16-1 1 17 of 
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Sequence ID No. 5) disrupts the reading frame of the transcript and introduces a 
premature translation termination (i.e., STOP) codon into the transcript. 

Clones NMDA26 and NMDA27 (see Figure 4) are partial NMDAR2C 
cDNAs that contain 5' untranslated sequence, a translation initiation codon and some 
5 of the coding sequence. Clone NMDA26 contains 1 88 base pairs of untranslated 
sequence whereas clone NMDA27 contains -1.1 kb of 5' untranslated sequence. The 
sequences of the 5' untranslated regions of these two clones are identical for the first 
15 nucleotides proceeding 5' of the translation initiation codon. However, beginning 
with the 16th nucleotide 5' of the translation initiation codon, the sequences of the two 
10 clones diverge (compare nucleotides 1 16-191 of Sequence ID No. 5 to nucleotides 1 - 
74 of Sequence ID No. 12). 

EXAMPLE 4 

Preparation of Full-length NMDAR2C Subunit cDNA Constructs 

15 Portions of the partial NMDAR2C clones can be ligated in a variety of 

ways to generate constructs encoding full-length NMDAR2C subunit variants. The 5' 
end of each NMDAR2C cDNA can be contributed by NMDA26, whereas the 3' ends 
of the constructs are contributed by various combinations of clones NMDA21, 
NMDA22, and NMDA24. Figure 5 depicts full-length NMDAR2C constructs and 

20 indicates the portions of the different clones that contribute to each construct. 

For example, full-length constructs can be prepared using methods 
such as those described in Example 2 for preparing NMDAR1 constructs. Thus, 
clone inserts are transferred into a vector (e.g., pcDNAl) for ease of manipulation and 
then desired portions of the cDNAs are isolated by restriction enzyme digestion of the 

25 vectors. This can require several steps and/or partial digests if, for example, there are 
no unique restriction enzyme sites surrounding the desired portions of the cDNAs. 
The desired cDNA fragments are then ligated and incorporated into an expression 
plasmid such as pcDNAl or pCMV-T7-2. 

Plasmid pCMV-T7-2 (see Figure 6) is a pUC19-based vector that 

30 contains a cytomegalovirus (CMV) promoter/enhancer, SV40 splice donor/splice 
acceptor sites located immediately downstream of the promoter, a T7 bacteriophage 
RNA polymerase promoter positioned downstream of the SV40 splice sites, an SV40 
polyadenylation signal downstream of the T7 promoter, and a polylinker between the 
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T7 promoter and the pol yadenylation signal. This vector thus contains all the 
regulatory elements required for expression of heterologous DNA in a mammalian 
host cell, wherein the heterologous DNA has been incorporated into the vector at the 
polylinker. In addition, because the T7 promoter is located just upstream of the 
5 polylinker, this plasmid can be used for synthesis of in vitro transcripts of 

heterologous DNA that has been subcloned into the vector at the polylinker. Plasmid 
pCMV-T7-3, also depicted in Figure 6, is identical to pCMV-T7-2 except that the 
order of the restriction enzyme sites in the polylinker is reversed. This plasmid can 
also be used for heterologous expression of NMDAR subunit DNA. 

10 Construct pcDNAl-26-NotI-24-5UT contains 188 base pairs of 5' 

untranslated sequence (nucleotides 1-188 of Sequence ED No. 5), the complete coding 
sequence of the first variant of the human NMDAR2C subunit (nucleotides 189-3899 
of Sequence ID No. 5) and -440 base pairs of 3' untranslated sequence (nucleotides 
3900-4340 of Sequence ID No. 5). The NMDAR2C cDNA is contained within the 

15 polylinker of expression vector pcDNAl for expression. 

Construct pCMV-26-NotI-24 (Sequence ID No. 5) contains 49 base 
pairs of 5' untranslated sequence (nucleotides 140-188 of Sequence ID No. 5), the 
complete coding sequence of a first variant of the human NMDAR2C subunit 
(nucleotides 189-3899 of Sequence ID No. 5) and -440 base pairs of 3' untranslated 

20 sequence (nuceotides 3900-4340 of Sequence ID No. 5). The NMDAR2C cDNA is 
contained within the polylinker of expression vector pCMV-T7-2 for expression. 

Construct pCMV-26-ScaI-24 (Sequence ID No. 5E) is identical to 
pCMV-26-NotI-24, except it contains 24-base pairs (Sequence ID No. 7) inserted 
between nucleotides 2350 and 2351 of Sequence ID No. 5. 

25 Construct pCMV-26-ScaI-22 (Sequence ID No. 5F) is identical to 

pCMV-26-NotI-24, except that it lacks 15-base pairs (nucleotides 1960-1974 of 
Sequence ID No. 5). 

Construct pCMV-26-ScaI-21-NotI-24 (Sequence ID No. 5G) is 
identical to pCMV-26-NotI-24, except that it lacks 51 -base pairs (nucleotides 2351- 

30 2401 of Sequence ID No. 5). 

Construct NMDAR2C-A1 5-124 (Sequence ID No. 5H) is identical to 
pCMV-26-NotI-24, except that it lacks 15-base pairs (i.e., nucleotides 1960-1974 of 
Sequence ID No. 5) and includes a 24-base pair sequence (i.e., Sequence ID No. 7; 
inserted between nucleotides 2350 and 2351 of Sequence ID No. 5). 
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Construct NMDAR2C-A1 5-^51 (Sequence ID No. 51) is identical to 
pCMV-26-NotI-24, except that it lacks 15-base pairs (i.e., nucleotides 1960-1974 of 
Sequence ID No. 5) and 51-base pairs (i.e., nucleotides 2351-2401 of Sequence ID 
No. 5). 

5 Additional full-length NMDAR2C constructs can readily be prepared 

as described herein. For example, 5' untranslated sequence obtained from NMDA27 
(instead of NMDA26) can be employed, and the 3' ends of the constructs can be 
contributed by various combinations of clones NMDA21, NMDA22, and NMDA24. 

Several methods (e.g., nucleic acid amplification, RNase protection 

10 assays, etc.), as described in Example 2, can be employed to determine which 
NMDAR2C subunit variants are actually expressed in various human tissues. 

Human NMDAR2C has 83.5% GC nucleotide content between 
nucleotides 2957 and 3166. To potentially enhance NMDAR2D subunit expression, 
the GC content in this region can be reduced while maintaining the native amino acid 

15 sequence. Synthetic DNAs can be made by oligonucleotide primer extension across 
this region. Four oligonucleotides, SE343 (Sequence ID No. 17), SE344 (Sequence 
ID No. 18), SE345 (Sequence ID No. 19 ), and SE346 (Sequence ID No. 20) were 
synthesized. These primers maintain the amino acid sequence of the human 
NMDAR2D receptor and some restriction sites, but lower the overall GC content of 

20 this region to 53.4%. The criteria for the modification of bases were: 1) to not have 
more than 4 guanine nucleotides in a row if at all possible, 2) to maintain the 
restriction cutting sites for Noil (nucleotides 2962 - 2969 of Sequence ID No. 5), 
Avail (nucleotides 3069 - 3073 Sequence ID No. 5), and Aatll (nucleotides 3156 - 
3161 of Sequence ID No. 5), 3) to reduce the secondary structure of the 

25 oligonucleotides as much as possible, 4) to not introduce any additional Notl, Avail or 
Aatll restriction sites into the sequence and 5) to have the basepair overlap between 
oligonucleotide pairs, {SE343 and SE344} or {SE345 and SE346} have a proposed 
melting temperature between 62-66°C. The oligonucleotide pair SE343 and SE344 
have complementary sequence from nucleotides 51 - 71 of Sequence ID Nos. 17 and 

30 18. The oligonucleotide pair SE345 and SE346 have complementary sequence from 
nucleotides 42 - 61 of Sequence ID No. 19 and nucleotides 43 - 62 of Sequence ID 
No. 20, resepectively. 

The primer pairs, {SE343 and SE344} and {SE345 and SE346}, are 
combined in a standard PCR reaction mixture, which contains 50 pmoles of each 

35 oligonucleotide, and are amplified according to the following PCR protocol: 
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Annealing temperature of 55°C for 1 min, extension temperature of 
72°C for 2 min and melting temperature, 96°C for 30 seconds for 30 
cycles. 

5 

The resulting 121 bp PCR product from the primer pair SE343-SE344 is digested with 
Notl and Aval, and the resulting 103 bp PCR product from the primer pair SE345- 
SE346 is digested with Aval and Aatll. These fragments are ligated into pCMV- 
NMDAR2C-26-NotI-24, which has been partially digested with both Notl and Aatll 
10 due to the presence of additional Notl and/or Aatll restriction sites in the vector 

sequence, to form pCMV-26-NotI-24-GCMOD. This construct, pCMV-26-NotI-24- 
GCMOD, contains nucleotides 140-2965 of Sequence ID No. 5, followed by the 195 
nucleotides set forth in Sequence ID No. 21, and then nucleotides 3161 to 4340 of 
Sequence ID. No. 5. 

15 

EXAMPLE 5 

Isolation of DNA Encoding Human NMDA Receptor NMDAR2A Subunits 

Two human cDNA libraries were prepared using different 
oligonucleotides (random and specific primers) to prime cDNA synthesis from RNA 

20 isolated from cerebellum tissue. The specific primer used for first-strand synthesis 
was SE162, nucleotides 904 to 929 of Sequence ED No. 10. cDNAs synthesized by 
random priming that ranged in size from 1.0-2.8 kb, and cDNAs synthesized by 
specific priming that ranged in size from 0.6-1.1 kb, were isolated and inserted into 
the XgtlO phage vector to generate the two libraries. 

25 The random-primed library (3 x 106 recombinants) was screened for 

hybridization to the 795-base pair insert from pcrNMDAR2A (see Example 3) in 5X 
SSPE, 5X Denhart's solution, 50% deionized formamide, 0.2% SDS, 200 |ag/ml 
sonicated, denatured herring sperm DNA at 42°C. Washes were performed in IX 
SSPE, 0.27r SDS at 55°C. The probe hybridized to 1 1 plaques. 

30 The specifically-primed library (6 x 10^ recombinants) was screened 

for hybridization to oligonucleotide SE177 (nucleotides 859 to 884 of Sequence ID 
No. 10) in 6X SSPE, 5X Denhart's solution, 107c deionized formamide, 0.27c SDS, 
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200 jae'ml sonicated, denatured herring sperm DNA at 42°C. Washes were performed 
in IX SSPE, 0.27r SDS at 50 C C. The probe hybridized to 2 plaques. 

Nine of the hybridizing plaques were purified and the inserts were 
characterized by restriction enzyme mapping and DNA sequence analysis. All clones 
5 contained partial cDNAs. Two of the clones, NMDA53 and NMDA54, contain the 
translation initiation codon and 320 base pairs and 88 base pairs, respectively, of 5' 
untranslated sequence. The sequences of four other clones, NMDA47, NMDA49, 
NMDAR50 and NMDA51, along with those of NMDA53 and NMDA54, overlap to 
comprise -70% of the human NMDAR2A subunit coding sequence (see nucleotides 1 

10 - 3084 of Sequence ID No. 10). 

To obtain clones containing the remaining -1300 base pairs of 3' 
sequence needed to complete the NMDAR2A coding sequence, 6.6 x 106 
recombinants of an additional human cDNA library (an amplified randomly primed 
cerebellum cDNA library with inserts ranging from 1.0 - 2.8 kb in length ) were 

15 screened for hybridization to an oligonucleotide corresponding to the 3' end of clone 
NMDA51 (oligo SE171; nucleotide 3454 to 3479 of Sequence ID No. 10) using the 
same conditions as used for screening the specifically primed cerebellum cDNA 
library as described above. Four hybridizing plaques were purified and the inserts 
were characterized by DNA sequence analysis to determine if they contain the 3' end 

20 of the coding sequence and a translation termination codon. Two of the clones 
(NMDA57 and NMDA58, which were determined to be identical), contain a 
translation termination codon, as determined by DNA sequence analysis. Phage 
lysate containing clone NMDA57 were deposited under the provisions of the 
Budapest Treaty with the American Type Culture Collection (ATCC) on April 13, 

25 1993, and assigned Accession No. 75442. 

EXAMPLE 6 

Preparation of Full-length NMDAR2A Subunit cDNA Constructs 

Two separate constructs encoding a full-length NMDAR2A subunit 
30 (pCMV-hNMDAR2A-l(53) and pCMV-hNMDAR2A-2(54) were prepared by 
ligating portions of the following partial NMDAR2A clones: NMDAR47, 
NMDAR50, NMDAR58 and either NMDAR53 or NMDAR54 (NMDAR53 and 
NMDAR54 differ only in the amount of 5' untranslated sequence contained in the 

- 42 - 



SD93S3 




clones. The inserts of clones NMDA47, NMDA50 and NMDA58 were isolated as 
EcoRl fragments and ligated with EcoRI-digested pCMV-T7-2 to create pNMDA47, 
pNMDA50 and pNMDA58, respectively. The inserts of clones NMDA53 and 
NMDA54 were isolated as Xhol fragments and ligated with 5c//I-digested pCMV-T7- 
5 2 to create pNMDA53 and pNMDA54, respectively. 

pNMDA47 was digested with Seal and Nsil to liberate an ~3,350-bp 
fragment containing a 3' portion of the P-lactamase gene, which encodes a protein 
which imparts ampicillin-resistance, and nucleotides 824-2415 of Sequence ID No. 
10. This fragment was ligated with a ~2890-bp NsiUScal fragment of pNMDA50 

10 (containing a 5' portion of the P-lactamase gene and nucleotides 2416-3346 of 
Sequence ID No. 10) to generate pNMDA47+50. 

The portion of pNMDA58 that encodes the 3' end of NMDAR2A 
contains two Mscl sites. Because the 3* Mscl site is cleaved in preference to the 5' 
Mscl site, partial digestion of pNMDA58 was not an option. Thus, pNMDA58 was 

15 digested with Scal/Mscl, and the ~2020-bp fragment containing a 5' portion of the (3- 
lactamase gene and a 3' portion of the insert (nucleotides 4751-4808 of Sequence ID 
No. 10) was isolated. This fragment was ligated to a ~4150-bp Scal/Mscl fragment of 
pNMDA47+50 (containing a 3' portion of the P-lactamase gene and nucleotides 824- 
3212 of Sequence ID No. 10) to generate pNMDA47+50+3 T END58. This plasmid 

20 contained a complete P-lactamase gene and nucleotides 824-3214 and 4751-4808 of 
Sequence ID No. 10. To add nucleotides 343-4750 of Sequence ID No. 10 to 
pNMDA47+50+3END58, pNMDA58 was digested with Mscl, and the isolated 1537- 
bp fragment consisting of nucleotides 3213-4750 of Sequence ID No. 10 was ligated 
to Mscl-digested pNMDA47+50+3END58. The resulting plasmid, 

25 pNMDA47+50+58, contained nucleotides 824-4808 of Sequence ID No. 10. 

To generate two constructs containing identical NMDAR2A coding 
sequences but differing amounts of 5' untranslated sequence, pNMDA53 and 
pNMDA54 were digested with ScaVEcoRl to liberate fragments containing a 3' 
portion of the P-lactamase gene and nucleotides 1-854 and 225-854 of Sequence ID 

30 No. 10, respectively. pNMDA47+50+58 was digested with ScaUEcoRl (partial) and 
the 3954-bp fragment containing a 5' portion of the P-lactamase gene and nucleotides 
855-4808 of Sequence ID No. 10 was separately ligated with the ScaVEcoRl 
fragments of pNMDA53 and pNMDA54 to generate pCMV-hNMDAR2A-l(53) and 
pCMV-hNMDAR2A-2(54), respectively. These two constructs are identical except 

35 for the amount of 5' untranslated sequence contained in each. Both contain a full- 
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length NMDAR2A-encoding sequence (nucleotides 31 1-4705 of Sequence ID No. 10) 
and 103 nucleotides of 3' untranslated sequence (nucleotides 4706-4808 of Sequence 
ID No. 10). pCMV-hNMDAR2A-l(53) contains 310 nucleotides of 5' untranslated 
sequence (nucleotides 1-310 of Sequence ID No. 10), whereas pCMV-hNMDAR2A- 
5 2(54) contains 87 nt of 5' untranslated sequence (nucleotides 224-3 10 of Sequence ID 
No. 10). The NMDAR2A cDNA is operatively linked to the regulator elements of 
pCMV-T7-2 for expression in mammalian host cells. 

There is no unique restriction site 3' of the NMDAR2A-specific DNA 
in pCMV-hNMDAR2A-l(53) that can be used to linearize the plasmid in order to 

10 prepare in vitro transcripts for injection into Xenopus oocytes. To make a construct 
that has a unique 3' restriction site (pCMV-hNMDAR2A-3(53)), essentially the entire 
NMDAR2A-specific DNA of pCMV-hNMDAR2A-l(53) was transferred into vector 
pCMV-T7-3 as follows. pCMV-NMDAR2A-l(53) was digested with Not\ and the 
~4.4-kb fragment was isolated and ligated with Nctfl-digested pCMV-T7-3 to generate 

15 pCMV-hNMDAR2A-3(53). 

EXAMPLE 7 

Isolation of DNA Encoding Human NMDA Receptor NMDAR2B Subunits 

A human fetal brain XZAP cDNA library (1 x 106 recombinants; 

20 Stratagene, La Jolla, CA) was screened for hybridization to a DNA fragment 
containing the entire rat NMDAR2B subunit coding sequence (see Monyer et al. 
(1992) Science 256:1217-1221). Hybridization was conducted in 50% deionized 
formamide, 5X Denhart's solution, 5X SSPE, 200 jag/ml sonicated, denatured herring 
sperm DNA and 0.2% SDS at 42°C. Washes were performed in 0.5X SSPE, 0.2% 

25 SDS at 65°C. One of the hybridizing clones excised from the human fetal brain 
library, NMDA81, containing a 5,435 bp insert and translation initiation and 
termination codons, encodes a full-length NMDAR2B subunit. This excised plasmid, 
which is in the pBluescript vector, was called pBS-hNMDAR2B. 

NMDA81 was digested with EcoRUEcoRV and the ~5.5-kbp fragment 

30 was isolated and ligated to £c<?RI/£coRV-digested pCMV-T7-3. The resulting 
construct, pCMVPL3-hNMDAR2B, contains the NMDAR2B coding sequence 
(nucleotides 210-4664 of Sequence ID No. 13), as well as 209 nucleotides of 5' 
untranslated sequence (nucleotides 1-209 of Sequence ID No. 13) and 339 nucleotides 
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of 3' untranslated sequence (nucleotides 4665-5003 of Sequence ID No. 13). The 
NMDAR2B-encoding DNA in this construct is operatively linked to regulatory 
elements in pCMV-T7-3 for expression in mammalian host cells. 

5 EXAMPLE 8 

Isolation of DNA Encoding Human NMDA Receptor NMDAR2D subunits 

A human fetal brain cDNA library (1 x 106 recombinants; Stratagene, 
La Jolla, CA) was screened by subtraction screening methods for DNA encoding a 
human NMDAR2D receptor subunit. In this method, plaques were selected on the 
10 basis of weak or no hybridization to DNAs encoding human NMDAR2A, 
NMDAR2B and NMDAR2C subunits. 



Initially, the library was screened for hybridization to pcrNMDAR2A 
(see Example 3) under low- stringency conditions (30% deionized formamide, 5X 

15 Denhart s solution, 5X SSPE, 200 ng/ml sonicated herring sperm DNA, 0.2% SDS at 
42°C). Washes were also performed using low-stringency conditions (2X SSPE, 
0.2% SDS, 50°C). The filters were stripped, then screened for hybridization to the 
pcrNMDAR2A fragment and to an -1200 bp PstI fragment of DNA encoding a 
human NMDAR2B subunit (see Example 7) and an -950 bp AccI fragment of DNA 

20 encoding a human NMDAR2C subunit (see Example 3). These fragments contain 
DNA encoding all of the putative transmembrane domains of the subunits. 
Hybridization was performed under high-stringency conditions (50% deionized 
formamide, 5X Denhart 's solution, 5X SSPE, 200 ng/ml sonicated herring sperm 
DNA, 0.2% SDS at 42°C) as were washes (0.1X SSPE, 0.1% SDS, 65°C). 

25 Eighteen of the plaques that hybridized weakly to pcrNMDAR2A in 

the initial low stringency screening of the library hybridized only weakly or not at all 
to portions of DNA encoding human NMDAR2A, NMDAR2B and NMDAR2C 
subunits in the high stringency screening. The plaques were purified, and the insert 
fragments were characterized by DNA sequence analysis. One of the inserts, 

30 NMDA96, corresponds to the 3' half of the human NMDAR2D subunit gene coding 
sequence. The sequence of this clone is provided in Sequence ID No. 15. 
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To obtain clones containing the remaining -2000 bp of 5* sequence 
needed to complete the NMDAR2D subunit coding sequence, the human fetal brain 
cDNA library was screened for hybridization to an -831 bp Sma\ fragment of the 
clone containing the 3'half of the NMDAR2D coding sequence under high stringency 
5 hybridization and washing with 0.5X SSPE, 0.2% SDS at 65°C. Nine hybridizing 
plaques were purified and analyzed by DNA sequencing, which revealed that none of 
the plaques contain DNA encoding a translation initiation codon and extending 3' to at 
least the 5 1 end of the clone containing the 3' half of the NMDAR2D coding sequence. 

A human cDNA library was prepared using a specific oligonucleotide, 
10 SE296, to prime cDNA synthesis from RNA isolated from human fetal brain. The 
specific primer used for first-strand synthesis was SE296 (nucleotides 2920-2949 of 
Sequence ID No. 15). cDNAs synthesized by specific priming that were greater than 
2.2 kb in size were isolated and inserted into the A.ZAPII phage vector to generate the 
library. 

15 The specifically primed library (1 x 106 recombinants) was screened 

for hybridization to the 831 bp Stnal fragment from NMDAR2D (nucleotides 2435- 
3265 of Sequence ID No. 15) in 5X SSPE, 5X Denhart's solution, 50% deionized 
formamide, 0.2% SDS, 200 jag/ml sonicated, denatured herring sperm DNA at 42°C. 
Washes were performed in 0.1 X SSPE, 0.2% SDS at 65°C. One probe hybridized to 

20 1 1 plaques. 

Eleven of the hybridizing plaques were purified, and the inserts 
characterized by restriction enzyme mapping and DNA sequence analysis. Six of the 
clones (NMDA111, NMDA1 12, NMDA115, NMDA116, NMDA119 and 
NMDA121) contain the translation initiation codon and varying amounts of 5' 

25 untranslated sequence. 

The sequences of these clones overlap with NMDA96 to constitute 
100% of the human NMDAR2D subunit coding sequence (see nucleotides 485-4495 
of Sequence ID No. 15). 

The full-length hNMDAR2D construct was prepared using NMDA1 15 

30 and NMDA96 cDNAs. NMDA1 15 and NMDA96 cDNAs are already in the 

pBlueScript vector, however the NMDA1 15 cDNA is in the sense orientation from 
the T7 promoter, while the NMDA96 cDNA is in the antisense orientation. For ease 
of subcloning the full-length construct, the NMDA96 cDNA was cloned into the 
sense orientation by digesting NMDA96 with EcoRl and screening the resulting 

35 clones for orientation (NMDAR96-T7). Within the complete human NMDAR2D 
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sequence, there is a unique HindlU at nucleotides 2804 that was used to clone 
NMDA1 15 together with NMDA96. However, there is an additional HindlU site in 
the pBS polylinker at the 5' end of the NMDA1 15 cDNA. Therefore NMDA1 15 was 
fully digested with Spel, a 3' polylinker site, and partially digested with HindlU. The 
5 resulting -5.6 kb Spel-Hindlll fragment from pNMDAl 15 (pBS vector plus 

nucleotides 397-2804 of Sequence ID No. 15)) was Hgated with the 1.7 kb HindlU- 
Spel fragment (nucleotides 2805-4651 of Sequence ID No. 15) from NMDA96-T7 to 
form pBS-hNMDAR2D. In vitro transcripts were prepared for co-injection into 
Xenopus oocytes to test for alteration of NMDAR1 A currents. 

10 The complete NMDAR2D insert is then transfered into the pMMTV- 

T7+ mammalian expression vector as a -4.7 kb EcoRV/Spel fragment. The EcoKV 
and Spel restriction sites are in the multiple cloning region of the pBluscript vector. 

In summary, construct NMDAR2D contains 88 base pairs of 5' 
untranslated sequence (nucleotides 397-484 in Sequence ID No. 15), the complete 

15 coding sequence for the NMDAR2D subunit (nucleotides 484-4495 of Sequence ID 
No. 15) as well as 200 base pairs of 3' untranslated sequence (nucleotides 4496-4695 
of Sequence ID No. 15). The NMDAR2D subunit encoding sequence is operatively 
linked to the regulatory elements in pMMTV-T7 for expression in mammalian cells. 

20 EXAMPLE 9 

Expression of Recombinant Human NMDA Receptor Subunits on Oocytes 

Xenopus oocytes were injected with in vitro transcripts prepared from 
constructs containing DNA encoding human NMDA receptor NMDAR1 and 
NMDAR2 subunits. Electrophysiological measurements of the oocyte 
25 transmembrane currents were made using the two-electrode voltage clamp technique 
(see e.g., Stuhmer ( 1992) Meth. Enzymol 207:319-339). 

A. Preparation of In Vitro Transcripts 

30 Recombinant capped transcripts of NMDA receptor subunit cDNAs 

contained in constructs NMDAR1 A. NMDAR1-I63. NMDAR1 -I63-A204, 
NMDAR1-A1087. NMDAR1 -A363. and pCMV-26-NotI-24 were synthesized from 
linearized plasmids using the mCAP RNA Capping Kit (Cat. #200350, Stratagene, 
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Inc., La Jolla. CA). For experiments in which NMDAR2A or NMDAR2B and 
NMDAR1 or NMDAR 1-163 transcripts were co-injected into Xenopus oocytes, the 
transcripts were synthesized from linearized constructs NMDAR1 A, NMDAR1-I63, 
pCMV-hNMDAR2A-3(53), pCMV-26-7VofI-24 and pBS-hNMDAR2B using 
5 mMessage mMachine (Ambion, catalog #1344, Austin, TX). The mass of each 
synthesized transcript was determined by UV absorbance and the integrity of each 
transcript was determined by electrophoresis through an agarose gel. 

B. Electrophysiology 

10 

Xenopus oocytes were injected with 12.5-50 ng of one or more NMD A 
receptor subunit transcripts per oocyte. The preparation and injection of oocytes were 
carried out as described by Dascal [(1987) Crit. Rev. Biochem. 22:317-387]. Two-to- 
six days following mRNA injection, the oocytes were examined using the two- 

15 electrode voltage clamp technique. The cells were bathed in Ringers solution (115 
mM NaCl, 2.5 mM KC1, 1.8 mM CaCl 2 , 10 mM HEPES, pH 7.3), and the membrane 
potential was clamped at -80 to -100 mV. Drugs were applied by pipetting 6.0 jal 
aliquots of drug-containing solution directly into the bath, or by using gravity-feed 
into a Warner Instruments chamber (volume = 1 10 (il) at a flow rate of 8 ml/min. The 

20 data were sampled at 2-5 Hz with a Labmaster data acquisition board in a PC-386 
using AXOTAPE version 1.2 (Axon Instruments, Foster City, CA) software. The 
data were exported to a laser printer or plotted using Sigmaplot version 5.0. 

NMD A agonists, i.e., 10-30 \iM glycine (gly) and 10-100 jiM 
glutamate (glu) or 100-1000 jaM NMDA, were applied to the bath. If a current 

25 response was observed, the agonists were washed from the bath and 0.1-1.0 mM 
MgCb or 1 |iM MK801 (Research Biochemicals, Inc., Natick, MA) (NMDA 

receptor antagonists) were applied before a second agonist application in order to 
determine whether the current was blocked by antagonists. Alternatively, MgCl2 or 

MK-801 were applied during agonist-induced current flow. The results of multiple 
30 recordings are summarized in Table 1. 
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Analysis of the results shown in Table 1 indicates that, in general, the 
NMD A agonist-induced currents were blocked by either MgCl : or MK801. 

Oocytes injected with transcnpts (12.5 to 65 ng) of the NMDAR-1 
subunit-encoding inserts of constructs NMDAR 1 A, NMDAR 1-163 or NMDAR 1- 
5 A363 were further analyzed to evaluate human NMDA receptor sensitivity to 

glutamate and NMDA. The two-electrode voltage clamp methods described above 
were used to measure current in the cells. 

To determine glutamate and NMDA sensitivity of the recombinant 
human NMDA receptors, various concentrations of glutamate (0.1 - 100 jaM) or 
10 NMDA (3-1000 j^M) were applied to the bath (in the presence of 10-30 jiM glycine) 
and the current response was recorded. The bath was flushed between agonist 
applications. Intermediate test applications of 10 |iM glycine plus 10 |iM glutamate 
were included in the experiments to monitor the receptors for run-down (i.e., 
inactivation of receptors that have been repeatedly activated during prolonged 
15 electrophysiological recording). The data were used to generate dose-response curves 
from which EC50 values for the two agonists were calculated. Glycine sensitivity 

was determined in the same manner except that various concentrations (0.1-100 \±M) 
of glycine were co-applied with 100 |iM NMDA. 

The EC50 values determined for glutamate stimulation of NMDA 

20 receptors expressed in oocytes injected with NMDAR1A, NMDAR1-I63 or 

NMDAR1-A363 transcripts were 0.4, 0.6 and 0.5 jiM, respectively. The EC50 values 

determined for NMDA stimulation of NMDA receptors expressed in oocytes injected 
with NMDAR1 A, NMDAR1-I63 or NMDAR1-A363 transcripts were 6.3, 10.9 and 
1 1.9 jj.M, respectively. 

25 There was a marked potentiation of the current magnitude in response 

to glutamate and glycine in oocytes co-injected with in vitro transcripts of pCMV- 
hNMDAR2A-3(53) and NMDAR 1 A or NMDAR 1-163 compared to the currents 
recorded in oocytes injected with transcripts of either NMDAR1 A or NMDAR1-I63 
alone. Similarly, there was a marked potentiation of the current magnitude in 

30 response to glutamate and glycine in oocytes co-injected with in vitro transcripts of 
NMDAR 1 A and pBS-hNMDAR2B compared to the currents recorded in oocytes 
injected with only the NMDAR 1 A transcript. 

To investigate the pharmacological properties of human NMDA 
receptors generated by coexpression of the human NMDAR 1 A, NMDAR2A and 

35 NMDAR2C subunits, oocytes were co-injected with 50 ng each of in vitro transcripts 
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prepared from the NMDAR1 A, pCMV-hNMDAR2A-3, and P CMV-26-NotI-24 
(NMDAR2C) constructs. The sensitivity of the recombinant heteromenc receptors to 
glycine and NMDA was determined as described above. The EC 50 for glycine 
activation of inward currents in these recombinant oocytes was calculated from the 
5 dose-response curve to be 0.87 ± 0.24 jaM (mean ± S.D. of 4 oocytes), which was 
significantly different than the EC50 calculated for glycine sensitivity of oocytes 

injected with 50 ng each of in vitro transcripts of NMDAR1 A and pCMV- 
hNMDAR2A-3 alone (1.9 ± 0.26 jiM, ; p = 0.0002, one-tailed t-test). The sensitivity 
to NMDA also increased when human NMDAR2C was co-expressed with human 
10 NMDAR1 A and NMDAR2A subunits. The EC50 for NMDA was shifted from 30.2 

±9.4 p.M for oocytes co-injected with 50 ng each of in vitro transcripts of 
NMDAR1 A and pCMV-hNMDAR2A-3 to 1 1 .9 ± 5.2 |iM for oocytes co-injected 
with 50 ng each of in vitro transcripts of NMDAR1 A, pCMV-hNMDAR2A-3 and 
pCMV-26-NotI-24 (mean ± S.D. of 4 oocytes). 

15 

EXAMPLE 10 

Recombinant Expression of Human NMDA Receptor Subunits in Mammalian Cells 

Mammalian cells, such as human embryonic kidney (HEK293 ) cells 
can be transiently and/or stably transfected with DNA encoding human NMDA 

20 receptor subunits (e.g., DNA encoding an NMDAR1 subunit or DNA encoding an 

NMDAR1 subunit and DNA encoding an NMDAR2 subunit such as pCMV-26-NotI- 
24, pCMV-hNMDAR2A-3(53) or pCMVPL3-hNMDAR2B). Transfectants are 
analyzed for expression of NMDA receptors using various assays, e.g., northern blot 
hybridization, electrophysiological recording of cell currents, Ca2+-sensitive 

25 fluorescent indicator-based assays and [3H]-MK801 binding assays. 

A. Transient Transfection of HEK Cells 

Two transient transfections were performed. In one transfection, HEK 
30 293 cells were transiently transfected with DNA encoding an NMDAR1 (construct 
NMDAR1 A) subunit. In another transfection, HEK 293 cells were transiently co- 
transfected with DNA encoding NMDAR1 (construct NMDAR1A) and NMDAR2C 
(pCMV-26-NotI-24) subunits. In both transfections, -2x 106 HEK cells were 
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transiently transfected with 19 jag of the indicated plasmid(s) according to standard 
CaP04 transfection procedures [Wigler et al. (1979) Proc. Natl. Acad. Sci. USA 

76:1373-1376]. In addition. 1 of plasmid pCMV(3gal (Clontech Laboratories, Palo 
Alto, CA), which contains the Escherichia coli (3-galactosidase gene fused to the 
5 CMV promoter, were co-transfected as a reporter gene for monitoring the efficiency 
of transfection. The transfectants were analyzed for [3-galactosidase expression by 
direct staining of the product of a reaction involving (3-galactosidase and the X-gal 
substrate [Jones (1986) EMBO 5:3133-3142]. Transfectants can also be analyzed for 
(3-galactosidase expression by measurement of (3-galactosidase activity [Miller (1972) 
10 in Experiments in Molecular Genetics, pp. 352-355, Cold Spring Harbor Press]. 

The efficiency of these transfections of HEK cells was typical of 
standard efficiencies (i.e., -50%). 

B. Stable Transfection of Mammalian Cells 

15 

Mammalian cells, such as HEK 293 cells, can be stably transfected 
using the calcium phosphate transfection procedure [Current Protocols in Molecular 
Biology, Vol. 7, Wiley Inter-Science, Supplement 14, Unit 9.1.1-9.1.9 (1990)]. Ten- 
cm plates, each containing 1-2 x 106 cells, are transfected with 10 ml of 

20 DN A/calcium phosphate precipitate in media containing approximately 19 jag of 
NMDA receptor subunit-encoding DNA and 1 \ig of DNA encoding a selectable 
marker, for example, neomycin-resistance gene (i.e., pSV2neo). After -14 days of 
growth in media containing typically 1 |!g/ml G418, colonies form and are 
individually isolated using cloning cylinders. The isolates are then subjected to 

25 limiting dilution and screened to identify those that express NMDA receptors using, 
for example, methods described below. 

C. Analysis of Transfectants 

30 1. Northern Blot Hybridization Analysis 

Total RNA was isolated from -1 x 107 HEK cells co-transfected with 
NMDAR1 and pCMV-26-NotI-24, and 5-10 ^g of RNA was used for northern 
hybridization analysis. Fragments from human neuronal NMDAR subunit-encoding 
35 plasmids were randomly primed and labeled with 32p_dCTP Klenow incorporation 
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and used as probes. The northern blot hybridization and wash conditions were as 
follows: 

hybridization in 5x SSPE, 5X Denhart s solution, 507c formamide, at 
42°C followed by washing in 0.2x SSPE, 0.1% SDS, at 65°C. 

5 

Results of these studies revealed the transfectants expressed detectable 
levels of NMDAR1 and NMDAR2C mRNA of the appropriate size (based on the size 
of the cDNAs). 

10 2. Fluorescent indicator-based assays 

Activation of ligand-gated NMDA receptors by agonists leads to an 
influx of cations (both monovalent and divalent), including Ca2+, through the 
receptor channel. Calcium entry into the cell through the channel can in turn induce 

15 release of calcium contained in intracellular stores. Monovalent cation entry into the 
cell through the channel can also result in an increase in cytoplasmic calcium levels 
through depolarization of the membrane and subsequent activation of voltage- 
dependent calcium channels. Therefore, methods of detecting transient increases in 
intracellular calcium concentration can be applied to the analysis of functional 

20 NMDA receptor expression. One method for measuring intracellular calcium levels 
relies on calcium-sensitive fluorescent indicators. 

Calcium-sensitive indicators, such as fluo-3 (Catalog No. F-1241, 
Molecular Probes, Inc., Eugene, OR) are available as acetoxymethyl esters which are 
membrane permeable. When the acetoxymethyl ester form of the indicator enters a 

25 cell, the ester group is removed by cytosolic esterases, thereby trapping the free 
indicator in the cytosol. Interaction of the free indicator with calcium results in 
increased fluorescence of the indicator; therefore, an increase in the intracellular Ca2+ 
concentration of cells containing the indicator can be expressed directly as an increase 
in fluorescence. An automated fluorescence detection system for assaying NMDA 

30 receptors has been described in commonly assigned pending US Patent Application 
No. 07/812,254 and corresponding PCT Patent Application No. US92/1 1090, 
incorporated by reference herein in their entirety. 

Mammalian cells that have been transfected with DNA encoding 
NMDAR1 or NMDAR1 and NMDAR2 subunits can be analyzed for expression of 
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functional recombinant NMDA receptors using the automated fluorescent indicator- 
based assay. The assay procedure is as follows. 

Untransfected mammalian host cells (or host cells transiently 
transfected with pCMV-T7-2) and mammalian cells that have been transfected with 
5 NMDAR1 ± NMDAR2 subunit DNA are plated in the wells of a 96-well microtiter 
dish (Nunc Catalog No. 1-6708, available through Alameda Industries, Escondido, 
CA) that has been precoated with poly-L-lysine at a density of 2.5 x 105 cells/well 
and loaded with fluo-3 by incubation for 2 hours at 20°C in a medium containing 20 
I^M fluo-3, 0.2% Pluronic F-127 in HBS (125 mM NaCl, 5 mM KC1, 1 .8 mM CaCb, 

10 0.62 mM MgCb, 20 mM glucose, 20 mM HEPES, pH 7.4). The cells are then 
washed with assay buffer (i.e. HBS). The microtiter dish is then placed into a 
fluorescence plate reader (e.g., Fluoroskan II, Lab Products International, Ltd., 
Raleigh, NC) and the basal fluorescence of each well is measured and recorded before 
addition of 10 (iM glycine and 10 |iM glutamate to the wells. The fluorescence of the 

15 wells is monitored repeatedly (75 readings at 0.63-sec intervals) following addition of 
agonist. 

The fluorescence of the untransfected host cells preferably will not 
change after addition of glycine and glutamate, i.e., the host cells should not express 
endogenous excitatory amino acid receptors. The fluorescence of mammalian cells 

20 transfected with NMDAR1 ± NMDAR2 subunit DNA will increase after addition of 
glycine and glutamate if a sufficient number of functional NMDA receptors are 
expressed at the cell surface, and fluorescence readings are taken rapidly. 

The resting potential of the membrane of some mammalian host cells 
may be relatively positive (e.g., -35 mV). Because activation of some NMDA 

25 receptors may be significantly reduced at relatively positive potentials, it may be 
necessary to lower the resting potential of the membrane of cells transfected with 
human NMDA receptor subunit-encoding DNAs prior to assaying the cells for 
NMDA receptor activity using the fluorescent indicator-based assay. This may be 
accomplished by adding valinomycin (-10 \iM) to the transfected cells prior to adding 

30 NMDA receptor agonists to initiate the assay. 

3. NMDA Receptor Ligand Binding Assays 

Mammalian cells transfected with NMDAR1 ± NMDAR2 subunit 
35 DNAs can be analyzed for [3H]-MK801 binding. An additional ligand-binding assay 
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for NMDA receptors using 3H-CGP39653 is also descnbed below. Rat brain 
membranes are included in the binding assays as a positive control. 



a. Preparation of Membranes 

i. Buffy coat Homogenate from Rat Cerebral 



Cortex 



Buffy coat membranes are prepared from rat brain cortices as 
10 described by Jones et al [(1989)7. Pharmacol Meth. 27:161]. Briefly, cortices from 
ten freshly thawed frozen rat brains are dissected and weighed. The tissue is 
homogenized in 20 volumes of 0.32 M ice-cold sucrose in a glass homogenizing tube 
using a Teflon pestle. The suspension is centrifuged at 1,000 x g for 10 minutes at 
4°C. The supernatant is decanted and centrifuged at 20,000 x g for 20 minutes at 4°C. 
15 The pellet is resuspended in 20 volumes of ice-cold distilled water with a Polytron for 
30 sec at setting 6. The suspension is centrifuged at 8,000 x g for 20 minutes at 4°C. 
The buffy coat pellet is rinsed gently with supernatant and then recentrifuged at 
48,000 x g for 20 minutes at 4°C. The pellet is resuspended in 20 volumes of ice-cold 
distilled water with a Polytron and centrifuged again at 48,000 x g for 20 minutes. 
20 The wash step is repeated once more. The final suspension is divided into aliquots, 
centrifuged. Each pellet can be stored frozen at -20°C for 12 hrs or more before use. 

ii. Membranes from Transfected and Untransfected 

Mammalian Cells 

25 

In order to prepare membranes from transfected and untransfected 
mammalian cells, the cells are scraped from the tissue culture plates, and the plates 
are rinsed with 5 ml of PBS (phosphate-buffered saline: 137 mM NaCl, 2.7 mM KC1, 
10 mM Na2HP04, 1.7 mM KH2PO4). The cells are centrifuged at low speed in a 

30 table-top centrifuge, and the cell pellet is rinsed with PBS. The cell pellet is 

resuspended in 20 ml of 10 mM Hepes buffer, pH 7.4, using a Polytron at setting 3-6 
for 30 seconds. The cell suspension is centrifuged at 48,000 x g for 20 minutes at 
4°C. The supernatant is discarded, and the pellet is kept frozen for 12 hrs or more at - 

20°C. 

35 
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b. [3H]-MK801 Binding to NMDA Receptors 

The binding of [3H]-MK801 to NMDA receptors is carried out as 
described by Wong et al [( 1986) Proc. Natl Acad. ScL USA &?:7104], with a few 
5 minor changes. Thus, on the day of the assay, the rat brain and mammalian cell 

(transfected and untransfected) membrane pellets are resuspended in 50 volumes of 
10 mM Hepes buffer, pH 7.4, using a 10-ml syringe and a 21 -gauge needle, and 
incubated for 20 minutes at 37°C. The supernatant is centrifuged at 48,000 x g for 20 
minutes at 4°C. The pellet is resuspended in 2 ml of 10 mM Hepes, pH 7.4 and 

10 centrifuged as described above. The wash step is repeated once more, and the pellet 
is resuspended in 10 ml of 10 mM Hepes, pH 7.4. The protein concentration is 
determined using the Biorad Bradford reagent. The pellet is finally resuspended in 
the assay buffer (10 mM Hepes, pH 7.4) at 1 mg/ml. 

For binding studies, the membrane suspension is incubated in duplicate 

15 with 2.5 nM [3H]-MK801 (New England Nuclear, Boston, MA) in a total volume of 
0.5 ml assay buffer (10 mM Hepes, pH 7.4) in the presence and absence of 10 jiM 
glutamate and 10 \iM glycine for 60 or 120 min at 23°C. Bound radioactivity is 
separated from free radioactivity by rapid filtration through Whatman GF/C filters 
which are presoaked for 2-3 hrs in 0.05% polyethylenimine. The filters are washed 

20 twice with 3 ml ice-cold assay buffer. The filters are dried and transferred to 

scintillation vials, each containing 10 ml of scintillation fluid. The vials are vortexed, 
and the radioactivity is measured in a Beckman scintillation counter. The nonspecific 
binding observed in the presence of 10 (iM MK801 is subtracted from the total 
binding in order to determine the specific binding. 

25 Rat brain cortical buffy coat membranes displayed specific saturable 

binding of [3HJ-MK801. In the presence of glycine and glutamate, the ratio of total- 
to-nonspecific binding (S:N ratio) was 28:1, whereas in the absence of glutamate and 
glycine the S:N ratio was 5:1 . Thus, the binding of MK801 to rat NMDA receptors is 
potentiated by glutamatergic agonists. Scatchard analysis of [3H]-MK801 binding to 

30 rat brain membranes indicated that the sensitivity of the assay was 90 fmoles of 
receptor. 

c. [ 3 H]-CGP39653 Binding to NMDA Receptors 
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The binding of [3H]-CGP39653 to rat brain membranes is carried out 
as described by Sills et al [(1991) Eur. 7. Pharmacol 792:19]. The buffy coat 
membrane pellet is resuspended in 50 volumes of 5 mM Tns-HCl containing 10 mM 
EDTA, pH 7.7, and incubated for 10 min. at 37°C. The supernatant is centrifuged at 
5 48,000 x g for 10 min. at 4°C. The wash step is repeated once and the pellet is 

resuspended in 10 ml of 5 mM Tns-HCl containing 10 mM EDTA, pH 7.7. This rat 
brain membrane suspension is incubated in duplicate or triplicate with 2.0 nM 
[3H]-CGP39653 (New England Nuclear) in a total volume of 0.5 ml assay buffer (5 
mM Tris-HCl, pH 7.7) for 60 min at 0°C. Nonspecific binding is determined in the 

10 presence of 100 |iM glutamate. Bound radioactivity is separated from the free by 
vacuum filtration through GF/C filters which are presoaked for 2-3 hrs in 0.05% 
polyethylenimine, using the filtration manifold. Unbound radioactivity is removed 
with two washes of 3 ml each of ice-cold buffer. The filters are dried and transferred 
to scintillation vials, each containing 10 ml of scintillation fluid. The vials are 

15 vortexed, and the radioactivity is measured in a Beckman scintillation counter. The 
nonspecific binding observed in the presence of 100 jaM glutamate is subtracted from 
the total binding to determine the specific binding. 

[3H]-CGP39653 binding was first measured as a function of 
membrane concentration. Specific binding increased linearly with increasing 

20 membrane concentration up to 200 jag of protein in the presence of 2 nM [^H]- 
CGP39653. 

Saturation analysis of [3HJ-CGP39653 binding was carried out by 
incubating 150 |ig of rat buffy coat homogenate with increasing concentrations of 
[3H]-CGP39653 for 60 min at 4°C. Scatchard analysis indicated a single class of 
25 binding sites with a B ma x value of 0.69 ± 0.09 pmoles/mg and a Kd value of 12.3 ± 

0.12 nM. 

While the invention has been described in detail with reference to 
certain preferred embodiments thereof, it will be understood that modifications and 
variations are within the spirit and scope of that which is described and claimed. 

30 

Summary of Sequences 

Sequence ID No. 1 is a nucleotide sequence encoding a human 
N-methyl-D-aspartate (NMD A) receptor subunit, NMDAR1 A, and the deduced 
35 amino acid sequence thereof. 
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Sequence ID No. 1 A is a 3083 nucleotide sequence encoded by clone 
NMDA10, comprising nucleotides 320 - 3402 of Sequence ID No. 1. Thus, Sequence 
ID No. 1A differs from Sequence ID No. 1 in that it does not contain the 319 5' 
nucleotides, nor the 896 3' nucleotides thereof. 
5 Sequence ID No. IB is a 3155 nucleotide sequence encoded by clone 

NMDA11, comprising nucleotides 1 - 2961, plus nucleotides 3325 - 3518 of 
Sequence ID No. 1. Thus, Sequence ID No. IB differs from Sequence ID No. 1 by 
the deletion of 363 nucleotides from the 3' portion thereof (i.e., by the deletion of 
nucleotides 2962 - 3324 of Sequence ID No. 1), and further by the lack of the 781 

10 terminal 3' nucleotides of Sequence ID No. 1. 

Sequence ED No. 1C is a 2542 nucleotide sequence encoded by clone 
NMDA7, comprising nucleotides 556 - 831 of Sequence ID No. 1, plus an additional 
63 nucleotides (set forth in Sequence ID No. 3) and nucleotides 832 - 984, 1 189 - 
2961 and 3325 - 3599 of Sequence ID No. 1. Thus, Sequence ID No. 1C differs from 

15 Sequence ID No. 1 in that it does not contain the 555 5 -most nucleotides thereof, it 
does not contain the 204 nucleotides set forth as nucleotides 985 - 1 188 of Sequence 
ED No. 1, it does not contain the 363 3' nucleotides set forth as nucleotides 2962 - 

3324 of Sequence ID No. 1, and it does not contain the 700 3 -most nucleotides of 
Sequence ID No. 1, while it does contain an additional 63 nucleotides (Sequence ID 

20 No. 3) inserted between nucleotides 831 and 832 of Sequence ID No. 1. 

Sequence ED No. ID is a 593 nucleotide sequence encoded by clone 
NMD A3, comprising nucleotides 2617 - 2961, plus nucleotides 4049 - 4298 of 
Sequence ID No. 1. Thus, Sequence ID No. ID differs from Sequence ID No. 1 in 
that it does not contain the 2616 5' nucleotides thereof, and by the deletion of 1087 
25 nucleotides from the 3' portion thereof (i.e., by the deletion of nucleotides 2962 - 4048 
of Sequence ID No. 1). 

Sequence ID No. IE is a nucleotide sequence encoding human NMDA 
receptor subunit NMDAR1-A363, comprising nucleotides 1 - 2961, plus nucleotides 

3325 - 4298 of Sequence ID No. 1. Thus, Sequence ID No. IE differs from Sequence 
30 ID No. 1 in that it does not contain the 363 nucleotides set forth as nucleotides 2962 - 

3324 of Sequence ID No. 1. 

Sequence ID No. IF is a nucleotide sequence encoding human NMDA 
receptor subunit NMDAR1-A1087, comprising nucleotides 1 - 2961, plus nucleotides 
4049 - 4298 of Sequence ID No. 1. Thus, Sequence ID No. IF differs from Sequence 
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ID No. 1 in that it does not contain the 1087 nucleotides set forth as nucleotides 2962 
- 4048 of Sequence ID No. 1. 

Sequence ID No. 1G is a nucleotide sequence encoding human NMDA 
receptor subunit NMDAR1-I63. Sequence ID No. 1G is the same as Sequence ID No. 
5 1, further comprising an additional 63 nucleotides (set forth in Sequence ID No. 3) 
inserted between nucleotides 831 and 832 of Sequence ID No. 1. 

Sequence ID No. 1H is a nucleotide sequence encoding human NMDA 
receptor subunit NMDAR1-I63-A204. Sequence ID No. 1H is the same as Sequence 
ID No. 1G, except Sequence ED No. 1H does not contain the 204 nucleotides set forth 
10 as nucleotides 985 - 1188 of Sequence ID No. 1. 

Sequence ID No. II is a nucleotide sequence encoding human NMDA 
receptor subunit NMDAR1-I63-A204-A363. Sequence ID No. II is the same as 
Sequence ID No. 1H, except Sequence ID No. II does not contain the 363 nucleotides 
set forth as nucleotides 2962 - 3324 of Sequence ID No. 1. 
15 Sequence ID No. 1J is a nucleotide sequence encoding human NMDA 

receptor subunit NMDAR1-A204. Sequence ID No. 1J is the same as Sequence ID 
No. 1, except Sequence ID No. 1J does not contain the 204 nucleotides set forth as 
nucleotides 985 - 1 188 of Sequence ED No. 1. 

Sequence ID No. IK is a nucleotide sequence encoding human NMDA 
20 receptor subunit NMDAR1-A204-A363. Sequence ID No. IK differs from Sequence 
ED No. 1 in that Sequence ID No. IK does not contain the 204 nucleotides set forth as 
nucleotides 985 - 1 188 of Sequence ED No. 1, nor the 363 nucleotides set forth as 
nucleotides 2962 - 3324 of Sequence ID No. 1. 

Sequence ID No. 1L is a nucleotide sequence encoding human NMDA 
25 receptor subunit NMDAR1- A204- A 1087. Sequence ID No. 1L differs from Sequence 
ID No. 1 in that Sequence ID No. 1L does not contain the 204 nucleotides set forth as 
nucleotides 985 - 1 188 of Sequence ID No. 1, nor the 1087 nucleotides set forth as 
nucleotides 2962 - 4048 of Sequence ID No. 1. 

Sequence ID No. 1M is a nucleotide sequence encoding human 
30 NMDA receptor subunit NMDAR1-I63-A363. Sequence ID No. 1M is the same as 
Sequence ID No. 1G except Sequence ID No. 1M does not contain the 363 
nucleotides set forth as nucleotides 2962 - 3324 of Sequence ID No. 1. 

Sequence ID No. IN is a nucleotide sequence encoding human NMDA 
receptor subunit NMDAR1-I63-A1087. Sequence No. IN is the same as Sequence ID 
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No. 1G except Sequence ID No. IN does not contain the 1087 nucleotides set forth as 
nucleotides 2962 - 4048 of Sequence ID No. I. 

Sequence ID No. IP is a nucleotide sequence encoding human NMDA 
receptor subunit NMDAR1-I63-A204-A1087. Sequence ID No. IP is the same as 
5 Sequence ID No. 1H, except Sequence ID No. IP does not contain the 1087 
nucleotides set forth as nucleotides 2962 - 4048 of Sequence ID No. 1. 

Sequence ID No. 2 is the amino acid sequence of the NMDA receptor 
subunit set forth in Sequence ID No. 1. 

Sequence ID No. 2A is the amino acid sequence of a portion of an 
10 NMDA receptor subunit as encoded by the nucleotide sequence of Sequence ID No. 
1A. 

Sequence ID No. 2B is the amino acid sequence of an NMDA receptor 
subunit encoded by the nucleotide sequence of Sequence ID No. IB. 

Sequence ID No. 2C is the amino acid sequence of a portion of an 
15 NMDA receptor subunit as encoded by the nucleotide sequence of Sequence ID No. 
1C. 

Sequence ED No. 2D is the amino acid sequence of a portion of an 
NMDA receptor subunit as encoded by the nucleotide sequence of Sequence ID No. 
ID. 

20 Sequence ID No. 2E is the amino acid sequence of an NMDA receptor 

subunit encoded by the nucleotide sequence of Sequence ID No. IE. 

Sequence ID No. 2F is the amino acid sequence of an NMDA receptor 
subunit encoded by the nucleotide sequence of Sequence ID No. IF. 

Sequence ID No. 2G is the amino acid sequence of an NMDA receptor 
25 subunit encoded by the nucleotide sequence of Sequence ID No. 1G. 

Sequence ID No. 2H is the amino acid sequence of an NMDA receptor 
subunit encoded by the nucleotide sequence of Sequence ID No. 1H. 

Sequence ID No. 21 is the amino acid sequence of an NMDA receptor 
subunit encoded by the nucleotide sequence of Sequence ID No. II. 
30 Sequence ID No. 2J is the amino acid sequence of an NMDA receptor 

subunit encoded by the nucleotide sequence of Sequence ID No. 1J. 

Sequence ID No. 2K is the amino acid sequence of an NMDA receptor 
subunit encoded by the nucleotide sequence of Sequence ID No. IK. 

Sequence ID No. 2L is the amino acid sequence of an NMDA receptor 
35 subunit encoded by the nucleotide sequence of Sequence ID No. 1L. 
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Sequence ID No. 2M is the amino acid sequence of an NMDA receptor 
subunit encoded by the nucleotide sequence of Sequence ID No. 1M. 

Sequence ID No. 2N is the amino acid sequence of an NMDA receptor 
subunit encoded by the nucleotide sequence of Sequence ID No. IN. 
5 Sequence ID No. 2P is the amino acid sequence of an NMDA receptor 

subunit encoded by the nucleotide sequence of Sequence ID No. IP. 

Sequence ID No. 3 is a nucleotide sequence encoding the 63 nucleotide 
insert present in Sequence ID Nos. 1C, 1G, 1H, II, 1M, IN and IP. 

Sequence ID No. 4 is the 21 amino acid sequence encoded by the insert 
10 set forth in Sequence ID No. 3. 

Sequence ID No. 5 is a nucleotide sequence of a clone (pCMV-26- 
Notl-24) encoding a human N-methyl-D-aspartate (NMDA) receptor subunit, 
NMDAR2C, and the deduced amino acid sequence thereof. 

Sequence ID No. 5A is a 2026 nucleotide sequence encoded by clone 
15 NMDA21, comprising nucleotides 931 - 2350, and 2402 - 3307 of Sequence ID No. 
5. Thus, Sequence ID No. 5A differs from Sequence ID No. 5 in that it does not 
contain the 930 5' nucleotides thereof, nor the 51 nucleotides located at position 2351 
- 2401 of Sequence ID No. 5, nor the 1061 3' nucleotides of Sequence ID No. 5. 

Sequence ID No. 5B is a 3698 nucleotide sequence encoded by clone 
20 NMDA22, comprising nucleotides 367 - 1300 of Sequence ID No. 5, plus an 

additional 1 1 nucleotides (set forth as Sequence ID No. 9), and nucleotides 1301 - 
1959 and 1975 - 4068 of Sequence ID No. 5. Thus, Sequence ID No. 5B differs from 
Sequence ID No. 5 by the lack of the 366 5-most nucleotides, by the insertion of 1 1 
nucleotides between nucleotides 1300 and 1301 of Sequence ID No. 5, and further by 
25 the lack of the 15 nucleotides of Sequence ID No. 5 from residue 1960 to residue 
1974. 

Sequence ID No. 5C is a 3243 nucleotide sequence encoded by clone 
NMDA24, comprising nucleotides 861 - 1300 of Sequence ED No. 5, plus an 
additional 1 1 nucleotides (Sequence ID No. 9), nucleotides 1301 - 2350 of Sequence 
30 ID No. 5, an additional 24 nucleotides (set forth as Sequence ID No. 7) and 

nucleotides 2351 - 4068 of Sequence ID No. 5. Thus, Sequence ID No. 5C differs 
from Sequence ID No. 5 in that it does not contain the 860 5-most nucleotides 
thereof, while it does contain an additional 1 1 nucleotides (Sequence ID No. 9) 
inserted between nucleotides 1300 and 1301, plus an additional 24 nucleotides 
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(Sequence ID No. 7) inserted between nucleotides 2350 and 2351 of Sequence ID No. 
5. 

Sequence ID No. 5D is a 3025 nucleotide sequence encoded by clone 
NMDA26, comprising nucleotides 1 - 3025 of Sequence ID No. 5. Thus, Sequence 
5 ID No. 5D differs from Sequence ID No. 5 in that it does not contain the 1043 
3 -terminal nucleotides thereof. 

Sequence ID No. 5E is a nucleotide sequence encoding human NMDA 
receptor subunit pCMV-26-ScaI-24, which differs from Sequence ID No. 5 only in 
the insertion of 24 nucleotides (Sequence ID No. 7) between nucleotides 2350 and 
10 235 1 of Sequence ID No. 5. 

Sequence ID No. 5F is a nucleotide sequence encoding human NMDA 
receptor subunit pCMV-26-ScaI-22, which differs from Sequence ID No. 5 only in 
the deletion of nucleotides 1960 - 1974 of Sequence ID No. 5. 

Sequence ID No. 5G is a nucleotide sequence encoding human NMDA 
15 receptor subunit pCMV-26-5raI-21-^rI-24, which differs from Sequence ED No. 5 only 
in the deletion of nucleotides 2351 - 2401 of Sequence ID No. 5. 

Sequence ID No. 5H is a nucleotide sequence encoding human NMDA 
receptor subunit NMDAR2C-A 15-124. Sequence ID No. 5H is the same as Sequence 
ID No. 5F, except Sequence ID No. 5H further contains the 24 nucleotide insert set 
20 forth in Sequence ID No. 7, positioned between nucleotides 2350 and 2351 of 
Sequence ID No. 5. 

Sequence ID No. 51 is a nucleotide sequence encoding human NMDA 
receptor subunit NMDAR2C-A15-A5L Sequence ID No. 51 is the same as Sequence 
ID No. 5G, except Sequence ID No. 51 does not contain the 15 nucleotides set forth as 
25 nucleotides 1960 - 1974 of Sequence ID No. 5. 

Sequence ID No. 6 is the amino acid sequence of the NMDA receptor 
subunit set forth in Sequence ID No. 5. 

Sequence ID No. 6A is the amino acid sequence of a portion of an 
NMDA receptor subunit as encoded by the nucleotide sequence of Sequence ID No. 
30 5A. 

Sequence ID No. 6B is the amino acid sequence of a portion of an 
NMDA receptor subunit as encoded by the nucleotide sequence of Sequence ID No. 
5B. 
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Sequence ID No. 6C is the amino acid sequence of a portion of an 
NMDA receptor subunit as encoded by the nucleotide sequence of Sequence ID No. 
5C. 

Sequence ID No. 6D is the amino acid sequence of a portion of an 
5 NMDA receptor subunit as encoded by the nucleotide sequence of Sequence ID No. 
5D. 

Sequence ID No. 6E is the amino acid sequence of an NMDA receptor 
subunit encoded by the nucleotide sequence of Sequence ID No. 5E. 

Sequence ID No. 6F is the amino acid sequence of an NMDA receptor 
10 subunit encoded by the nucleotide sequence of Sequence ID No. 5F. 

Sequence ID No. 6G is the amino acid sequence of an NMDA receptor 
subunit encoded by the nucleotide sequence of Sequence ID No. 5G. 

Sequence ID No. 6H is the amino acid sequence of an NMDA receptor 
subunit encoded by the nucleotide sequence of Sequence ID No. 5H. 
15 Sequence ED No. 61 is the amino acid sequence of an NMDA receptor 

subunit encoded by the nucleotide sequence of Sequence ID No. 51. 

Sequence ID No. 7 is a nucleotide sequence encoding the 24 nucleotide 
insert present in Sequence ID Nos. 5C, 5E and 5H. 

Sequence ID No. 8 is the 7 amino acid sequence encoded by 
20 nucleotides 2-22 of the insert set forth in Sequence ID No. 7. Because the insert is 

introduced within a codon, the insert itself only encodes 7 amino acids. The terminal 
residues of the nucleotide insert participate in forming codons with adjacent sequence 
at the site of insertion. 

Sequence ID No. 9 is a nucleotide sequence encoding the 1 1 nucleotide 
25 insert present in Sequence ED Nos. 5B and 5C. 

Sequence ID No. 10 is a nucleotide sequence encoding a human 
N-methyl-D-aspartate (NMDA) receptor subunit, NMDAR2A. 

Sequence ID No. 1 1 is the amino acid sequence of an NMDA receptor 
subunit as encoded by the nucleotide sequence set forth in Sequence ID No. 10. 
30 Sequence ID No. 12 is the nucleotide sequence of 71 nucleotides of 5' 

untranslated sequence of clone NMDA27, plus the initiation codon (nucleotides 72 - 
74) of said clone. 

Sequence ID No. 13 is a nucleotide sequence of a clone encoding a 
human N-methyl-D-aspartate (NMDA) receptor subunit, NMDAR2B. 
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Sequence ID No. 14 is the amino acid sequence of the NMDA receptor 
subunit set forth in Sequence ID No. 13. 

Sequence ID No. 15 is a nucleotide sequence of a clone encoding a 
human N-methyl-D-aspartate (NMDA) receptor subunit, NMDAR2D. 
5 Sequence ID No. 16 is the amino acid sequence of the NMDA receptor 

subunit set forth in Sequence ID No. 15. 

Sequence ID Nos. 17-20 are four synthetic oligonucleotides used in the 
preparation of an NMDAR2C clone (pCMV-26-NotI-24-GCMOD) having reduced 
GC nucleotide content between nucleotides 2957 and 3166. 
10 Sequence ID No. 21 is the nucleotide sequence of the 195 basepair 

insert of NMDAR2C clone pCMV-26-NotI-24-GCMOD (replacing nucleotides 2966- 
3 160 of Sequence ID No. 5). 
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( 1 J GENE RA L I NF 0 RMAT ION : 

5 

(i) APPLICANT: Daggett, Lome : 
Ellis, Steven 3. 
Li aw, Chen W. 
Lu, Chin- Chun 

10 



(n) TITLE OF INVENTION. HUMAN N- METHYL - D- ASPARTATE RECEPTOR 
S T J3L*NITS , DNA ENCODING SAME AND USES THEREFOR 

(ill) NUMBER OF SEQUENCES : 2 1 

15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Pretty, Schroeder, Brueggemann & Clark 
(3) STREET: 44 4 South Flower Street, Suite 2000 

(C) CITY: Los Angeles 
20 (D) STATE: CA 

( E ) COUNTRY : U.S.A. 

(F) ZIP: 90C71-2911 

(v) COMPUTER READABLE FOFM : 
25 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS DOS 

( D ) SO FT WAR E : Patent In Release #1.0, V e rsion #1.25 

30 (VI) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 
(3) FILING DATE: : 0 - APR 1954 
(C) CLASSIFICATION: 

35 :v:;i PRIOR APPLICATION LATA : 
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I A' APPLICATION' NUMBER: VS 15 ; : 1 , 4 4 r 

filing date: :: apr 1993 
(vi 11) attorney agent information: 

5 (A) NAME Reiner, Stephen E . 

( B ) REG I ST RAT I ON NUMBER : 31,19 2 

(G) REFERENCE/DOCKET NUMBER: P41 94 2 4 

(ix) TE LECOMMUN I GAT I ON I NFORMAT I ON : 
10 (A) TELEPHONE: 619-546 47 3 7 

(3) TELEFAX: 619 =46-93 92 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 9 3 base pairs 
( 3 ) TY ? E : nucleic aci d 
(G) STRAND EDNESS : both 
20 (D) TOPOLOGY: both 

(ii ) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
25 (A) NAME /KEY: CDS 

(3) LOCATION : 262 . . 3 D7 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

30 CAAGCCGGGC GTTCGGAGCT GTGCCCGGGC CCGCTTCAGC ACCGCGGACA GGGCCGGCCG 6 

CGTGGGGCTG AGCGCCGAGC CCCCGCGCAC GCTTCAGCCC CCCTTCCCTC GGCCGACGTC 12 



CCGGGACCGC CGCTCCGGGG GAGACGTGGC GTCCGCAGCC CGCGGGGCCG GGCGAGCGCA 



35 
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10 



; j G A CGGCCCG GAAGCCCCGC GGGGGA.GCG C C G AGGG CC C CGCGH CGCG C C G C G C AG AG 

CCAGGCCCGC GGCCCGAGCC C ATG AGC ACC ATG CGC CTG CTG ACG CTC GCC 

Me: 3er T'r.r Me: Arc leu leu Tr.r leu Ala 



CTG CTG TTC TCC TGC TCC GTC GCC CGC GC Z GCG TGC GAC CCC AAG ATC 33 9 

leu leu Phe Ser Cys Ser Val Ala Arg Ala Ala Cys Asp Pro lys He 

15 2j 25 

GTC AAC ATT GGC GCG GTG CTG AGC ACG CGG AAG CAC GAG CAG ATG TTC 3S7 

Val Asr. He Gly Ala Val leu Ser Thr Arg L/s His Glu Gin Met Phe 

3 j 3 5 4 3 



15 CGC GAG GCC GTG AAC CAG GCC AAC AAG CGG CAC GGC TCC TGG AAG ATT 43 5 

Arg Glu Ala Val Asn Gin Ala Asn lys Arg His Gly Ser Trp Lys He 

45 5D 55 

CAG CTC AAT GCC ACC TCC GTC ACG CAC AAG CCC AAC GCC ATC CAG ATG 4 83 

20 Gin leu Asn Ala Tnr Ser Val Thr His Lys Pro Asn Ala He Gin Me: 

6D 65 73 

GCT CTG TCG GTG TGC GAG GAC CTC ATC TCC AGC CAG GTC TAG GCC ATC 531 

Ala leu Ser Val Cys Glu Asp leu He Ser Ser Gin Val Tyr Ala He 

25 75 8 J 35 9 J 

CTA GTT AGC CAT CCA CCT ACC CCC AAC GAC CAC TTC ACT CCC ACC CCT 57 9 

leu Val Ser His Pro Pro Thr Pro Asn Asp His Phe Thr Pro Thr Pro 

95 ID j 10 5 



30 



jTC TCC TAC ACA GCC GG 2 TTC TAG CGC ATA CCC GTG CTG GGG CTG 
,'al Ser Tyr Tnr Ala Gl / Phe Tyr Arg He PrD Val leu Gly leu 



35 ACC CGI A.G i CC ATC T A7 7CG GAC AAG AGC AH CAC 
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Arg Me: Ser 



lie T/r Ser Asp Lys Ser He His Leu Ser 

- -J - - 1 Z. 



r-r.e _eu 



cg: act 

Arg Thr 

14 : 



ccg c:: 



tc: 



ca: 



Val ?r^ ?n Tyr Ser His 



ag: 

Ser 



Tro ?he 



10 



AT' j CGT GTC TAG AGC 



AAC 



:a: at: ah 



Me: Arg Val Tyr Ser Trp Asn His He 



155 



16 3 



[ * e Le i 
165 



AG 



jAC ga: 



Val Ser Asp Asp 

173 



15 



:ac gag ggc cgg g:g gct cag aaa cgg ctg gag ac 



. s G 1 J Gly Arg Ala Ala Gin Lys Arg Leu 



175 



IE 3 



Hr Leu Le: 



185 



CGT GAG TCC AAG GGA GAG AAG GTG CTG CAG TTT G AC CCA GGG ACC AAG 
Arg Glu Ser Lys Ala Glu Lys Val Leu Gin ?he Asp Pro Gly Thr Lys 



19 3 



195 



233 



819 



20 



AAC 



ACG GCC CTG CTG ATG GAG GCG AAA GAG 



Asn Val Thr Ala Leu Leu Met Glu Ala 



205 



21 j 



GAG GCC 3GG GT I 
Glu Ala Arg Val 
215 



915 



25 



ATC ATC CTT TCT GCC AGO GAG GAC GAT GCT GCC ACT GTA TAG CGC GCA 
lie He Leu Ser Ala Ser Glu Asp Asp Ala Ala Thr Val Tyr Arg Ala 



22 3 



225 



23 3 



963 



30 



ICC GCG ATG CTG AAC ATG ACG GGC TCC GGG 



TGG CTG GTC GGC 



Ala Ala Met Leu Asn Met Thr Gly Ser Gly Tyr Val Trp Leu Val Gly 



235 



24 0 



24 5 



GAG CGC GAG ATC TCG GGG AAC GCC CTG CGC TAC GCC CCA GAC GGC ATC 
Glu Arg Glu He Ser Gly Asn Ala Leu Arg Tyr Ala Pro Asp Gly He 



35 
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J 3 AC GC3 GTG GGC GT 3 GTG 3C 3 CAG GCC GTG CAC 3AG CTC CTC GA3 AAG 1 1 5 r 

Asp Ala Val Gly Val Val Ala Gin Ala Val His Glu leu leu 31 - lys 

285 291 295 

3 AG AA1 ATC ACC GAG CCG GCG GGG GGC TGC 3TG GGC AAC ACC AA2 ATC 12 2 3 

10 3lu Asp. lie Thr Asp PrD Pro Arg Gly lys Val Gly Asr. Thr Asp. He 

3 0 2 3 3 5 3 13 

TGG AAG ACC GGG CCG CTC TTC AAG AGA GTG CTG ATG TCT TCC .AAG TAT 12 51 

Trp Lys Thr Gly Pro Leu Phe Lys Arg Val Leu Met Ser Ser Lys Tyr 

15 315 323 325 333 

GCG GAT GGG GTG ACT GGT GGC GTG GAG TTC AAT GAG GAT GGG 3 AC CGG 12 99 

Ala Asp Gly Val Thr Gly Arg Val Glu Phe Asr. Glu Asp Gly Asp Arg 
3 3 5 3 4 3 3 4 5 

20 

AAG TTC GCC AAC TAG AGO ATC ATG AAC CTG CAG AAC CGC AAG CTG GTG 13 47 

Lys Phe Ala Asn Tyr Ser lie Met Asr. Leu Gin Asr. Arg Lys Leu Val 
350 355 350 

25 CAA GTG GGC ATC TAG AAT GGC ACC CAC GTG ATC CCT AAT GAC AGG AAG 13 95 

GLn Val Gly He Tyr Asn Gly Thr His Val lie Pro Asn Asp Arg Lys 

3 5 5 3 7 3 3 7 5 

ATC ATC TGG CCA GGC GGA GAG ACA GAG AAG CCT CGA GGG TAC CAG ATG 144 3 

30 lie He Trp Pro Gly Gly Glu Thr Glu Lys Pro Arg Gly Tyr Gin Met 

380 385 390 

TCC ACC AGA CTG AAG ATT GTG ACG ATC CAC CAG GAG CCC TTC GTG TAC 1491 

Ser Thr Arg Leu Lys lie Val Thr lie His Gin Glu Pro Phe Val Tyr 

35 395 422 43r 4i: 
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Pro 7hr Leu Ser Asp Gl/ -hr C/s Lys G _ i G~.: 



aa: gg r gat. cca gtc aag aag gtg atc :g: acc ggs gcc aac gag acg 15S? 

Asn Giy Asp Pro Val Lys Lys Val He Cys Thr Gly Pro Asr. Asp Thr 
4 3 0 4 3 5 44 : 

10 TC03 :CG GGG AGG CCC CG : GAG ACG GTG CCT GAG TGT TGG TAG GGC TTT 163 5 

Ser Pro Gly Ser Pro Arg His Thr Val Pr3 Gin Cys Cys Tyr Gly Phe 

445 45D 455 

TGG ATC GAG CTG GTC ATC AAG CTG GCA GGG ACG ATG AAC TTC ACG TAG 1683 

15 Cys lie Asp Leu Leu He Lys Leu Ala Arg Thr Met Asn Phe Thr Tyr 

4 5 J 4 6 5 4 7 j 

GAG GTG GAG CTG GTG GCA GAT GGC AAG TTC GGC ACA GAG GAG GGG GTG 1731 

Gli Val His Leu Val Ala Asp Gly Lys Phe Gly Thr Gin Glu Arg Val 
20 475 430 485 490 

AAC AAC AGG AAC AAG .AAG GAG TGG AAT GGG ATG ATG GGC GAG CTG CTG 177 9 

Asn Asn Ser Asn Lys Lys Glu Trp Asn Gly Met Met Gly Glu Leu Leu 

495 5jj 505 

25 

AGG GGG GAG GCA GAC ATG ATC GTG GCG CCG CTA ACG ATA AAC AAC GAG 132 7 

Ser Gly Gin Ala Asp Met He Val Ala Pro Leu Thr lie Asn Asn Glu 
510 515 523 

30 CGC GCG GAG TAG ATC GAG TTT TCC AAG CCC TTC AAG TAG GAG GGC CTG 137 5 

Arg Ala Gin Tyr He Glu Phe Ser Lys Pro Phe Lys Tyr Gin Gly Leu 



ACT ATT CTG GTC AAG AAG GAG ATT CCC CGG AGC ACG CTG GAC TCG TTC 
35 Thr lie Leu Val Lys Lys Glu He Pro Arc Ser Thr Leu Asp Ser Phe 
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10 



CC3 tt2 cag agc aca ctg tgg ctg ctg gtg ggg ctg tcg 

?:d Phe Glr. Ser Tr.r Leu Trp Leu Leu 7a 1 Gly Le-i 3er 



CAG GTG 2T3 GGG GTG AT 3 CTG TAG GTG CTG 3 AC CGC TTC AGC CC : TTC 
His 7a 1 7a 1 Ala 7a 1 Met Leu Tyr Le- Leu Asp Ar g Phe Ser Pro Phe 



575 



GGG CGG TTC AAG GTG AAG AGC GAG GAG GAG GAG GAG GAG GCA CTG ACC 7:0 67 

Gly Arg Phe Lys Val Asn Ser Glu Glu Glu Glu Glu Asp Ala Lea Thr 

5 9 j 5 9 5 6 J J 

15 CTG TCC TCG GCC ATG TGG TTC TCC TGG GGC GTC CTG CTC AAC TCC GGG L115 

Leu Ser Ser Ala Met Trp Phe Ser Trp Gly 7a 1 Leu Leu Asn Ser Gly 

6 0 5 613 515 

ATC GGG GAA GGC GCC CCC AGA AGC TTC TCA GCG CGC ATC CTG GGC ATG LI 63 

20 lie Gly Glu Gly Ala Pro Arg Ser Phe Ser Ala Arg lie Leu Gly Met 

522 625 632 

GTG TGG GCC GGC TTT GCC ATG ATC ATC GTG GCC TCC TAG ACC GCC AAC L211 

Val Trp Ala Gly Phe Ala Met He lie 7a 1 Ala Ser Tyr Thr Ala Asn 

25 635 64D 545 65j 

CTG GCG GC 2 TTC CTG GTG CTG GAC CGG CCG GAG GAG CGC ATC ACG GGC 22 59 

Leu Ala Ala Phe Leu 7a 1 Leu Asp Arg Pro Glu Glu Arg lie Thr Gly 

655 662 665 

30 

ATC AAC GAC CCC CGG CTG AGG AAC CCC TCG GAC .AAG TTT AT 2 TAG GCC 2 3 27 

lie Asn Asp ?r d Arg Leu Arg Asn Pro Ser Asp Lys Pne He Tyr Ala 

671 675 682 

35 ACG GTG AAG CAG AGC TCC GTG GAT ATC TAC TTC CGG CGC CAG GTG GAG 22 5 5 
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In Ser Ser Val Asp lie . yr Phe 



CTG AG: ACC ATG TAC CG3 CAT ATG GAG AAG CA : AAL TAG GAG AGT GCG 24 13 

5 Leu Ser Thr Me: Tyr Arg His Met Glu Lys His Asn Tyr Glu Ser Ala 

7 :■ j 7 C 5 7 1 : 

3CG GAG 3CC ATC ZAG GLC GTG AGA GAC AAC AA3 CT 3 CAT GCC TTC ATC 24 51 

Ala Gli Ala lie Gin Ala Val Arg Asp Asn Lys Lei His Ala Phe He 

10 715 723 725 731 

TGG GAC TCG GCG GTG CTG GAG TTC GAG GCC TCG CAG AAG TGC GAC CTG 24 9 9 

Trp Asp Ser Ala Val Leu Glu Phe Glu Ala Ser Gin Lys Cys Asp Lei 

735 74C 745 

15 

GTG ACG ACT GGA GAG CTG TTT TTC CGC TCG GGC TTC GGC ATA GGC ATG 2 54 7 

Val Thr Thr Gly Glu Leu Phe Phe Arg Ser Gly Phe Gly He Gly Met 

75D 755 763 

20 CGC AAA GAC AGC GCC TGG AAG CAG AAC GTC TCC CTG TCC ATC CTC AAG 2 59 5 

Arg Lys Asp Ser Pro Trp Lys Gin Asn Val Ser Leu Ser He Leu Lys 

765 77 0 775 

TCC CAC GAG AAT GGC TTC ATG GAA GAC CTG GAC AAG ACG TGG GTT CGG 2 64 3 

25 Ser Hls Glu Asn Gly Phe Met Glu Asp Leu Asp Lys Thr Trp Val Arg 

7 3 3 7 8 5 7 9 3 

TAT CAG GAA TGT GAC TCG CGC AGC AAC GCC CCT GCG ACC CTT ACT TTT 2 6 91 

Tyr Gin Glu Cys Asp Ser Arg Ser Asn Ala Pro Ala Thr Leu Thr Phe 

30 795 30 J 3C5 811 

GAG AAC ATG GCC GGG GTC TTC ATG CTG GTA GCT GGG GGC ATC GTG GCC 2^3? 

Glu Asn Met Ala Gly Val Phe Met Leu Val Ala Gly Gly He Val Ala 



35 
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GGG ATC TTL CTG ATT TTC ATC GAG ATT GCC TAG AAG GGG GAG AAG GAT 2~H 

Sly He Phe leu He Phe He Glu He Ala Tyr Lys Arg His Lys Asp 
831 =35 = 41 

5 GCT GGG CG 3 AAG GAG ATG GAG GTG G_C TTT GGG GGG GTT .AAG GTG TGG 2 S3 5 

Ala Arg Arg Lys Gin Me: Gin Leu Ala Phe Ala Ala 7a 1 Asn Val Trp 

345 35j 355 

GGG AAG AA: GTG CA3 3 AT AGA AAG A 3"T GGT AGA 3CA GAG GCT GAG GGT 2 88 3 

10 Arg Lys Asn Leu Gin Asp Arg Lys Ser Gly Arg Ala Glu Pro Asp Pro 

851 365 37 D 

AAA AAG AAA GGG ACA TTT AGG GCT ATG AGG TGG AGG GTG GGT TCC AGG 2931 

Lys Lys Lys Ala Thr Phe Arg Ala He Thr Ser Thr Leu Ala Ser Ser 

15 375 880 835 3 9 3 

TTC AAG AGG GGT AGG TCC TCC AAA GAC ACG AGG ACC GGG GGT GGA CGC 2 97 9 

Phe Lys Arg Arg Arg Ser Ser Lys Asp Thr Ser Thr Gly Gly Gly Arg 

395 9H 9j5 

20 

GGT GGT TTG GAA AAG GAA AAA GAC ACA GTG CTG 3LCG GGA CGC GCT ATT 3C27 

Gly Ala Leu Gin Asn Gin Lys Asp Thr Val Leu Pro Arg Arg Ala He 
31 D 915 92D 

25 GAG AGG GAG GAG GGC GAG CTG CAG CTG TGT TCC CGT CAT AGG GAG AGC 3 07 5 

Glu Arg Glu Glu Gly Gin Leu Gin Leu Cys Ser Arg His Arg Glu Ser 

925 93D 935 

TGAGACTCCC CGCCHCCCT CCTCTGCCCG TrCCCCCGCA GACAGACAGA C AG AC GG ACG 313 5 



30 



GGACAGCGGC CCGG 2CCACG CAGAGCCCCG GAGCACCACG "JGGTCGGGGG AGGAGCACCC 



HAGCCTCCC CCAGGCTGCG CCTGCCCGCC CGCCGG7TGG HGGCTGGCC GGTCCACCCC 



35 GTC7GGGGCC CGCGZGZGCC CCCAGCGTGG GGGTAACGGG 2GCCTTGTCT GTGTATTTCT 
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10 



;fcrmation for seq zd nq:2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 93 S arr.i no acids 
(3) TYPE: amino acid 
(D) TOPOLOGY : linear 

(n) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



15 



Met Ser Thr Met Arc Leu Leu Thr Leu Ala Leu Leu Phe Ser Cys Ser 

1 E 10 15 

Val Ala Arg Ala Ala Cys Asp Pro Lys lie Val Asn lie Gly Ala Val 

20 25 30 



Leu Ser Thr Arg Lys His Glu Gin Met Phe Arg Glu Ala Val Asn Gin 

20 35 40 45 

Ala Asn Lys Arg His Gly Ser Trp Lys lie Gin Leu Asn Ala Thr Ser 

50 55 60 

25 Val Thr His Lys Pro Asn Ala lie Gin Met Ala Leu Ser Val Cys Glu 

6 5 7 0 7 5 8 0 

Asp Leu lie Ser Ser Gin Val Tyr Ala lie Leu Val Ser His Pro Pro 

8E 90 95 



30 



Thr Pro Asn Asp His Phe Thr Pro Thr Pro Val Ser Tyr Thr Ala Gly 



Phe Tyr Arg He Pro Val Leu Gly Lej 

35 ::s 12: 



Arg Met Ser He .yr 



- 78 - 



SD9383 



Ser Asp lys 3er He His leu Ser ?he leu Arc 



Pro Pre Tyr 



5 Ser His Gin Ser Ser Val Trp ?he Giu Met Met Arc Val Tyr Ser Trp 



Asn His He lie leu leu Val Ser Asp Asp His Giu Gly Arg Ala Ala 

165 170 175 

10 

Gin lys Arg leu Giu Thr leu leu Giu Giu Arg Giu Ser lys Ala Giu 
180 185 190 

lys Val Leu Gin Phe Asp Pro Gly Thr lys Asn Val Thr Ala leu leu 
15 195 200 205 

Met Giu Ala lys Giu Leu Giu Ala Arg Val lie lie Leu Ser Ala Ser 

210 215 220 

20 Giu Asp Asp Ala Ala Thr Val Tyr Arg Ala Ala Ala Met Leu Asn Met 

225 230 235 240 

Thr Gly Ser Gly Tyr Val Trp Leu Val Gly Giu Arg Giu lie Ser Gly 
245 250 255 

25 

Asn Ala leu Arg Tyr Ala Pro Asp Gly lie Leu Gly leu Gin leu lie 

260 265 270 

Asn Gly lys Asn Giu Ser Ala His lie Ser Asp Ala Val Gly Val Val 
30 275 281 235 

Ala Gin Ala Val His Giu leu leu Giu lys Giu Asn lie Thr Asp Pro 

o ~i " " 



35 



Pre Arg Gly Cys Val Gly Asn Ihr Asn ^le . rp lys _hr Gly Pro leu 
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Phe Lys Arc Val leu Me: Ser Ser lys Tyr Ala Asp Gly Val Thr Gly 
325 33 3 3 3 5 

5 

Arg Val Glu Phe Asn Glu Asp. Gly Asp Arg Lys Phe Ala Asn Tyr Ser 
3 4 0 3 4 5 3 5 0 

lie Met Asr. Leu Gin Asn Arc Lys Leu Val Gin Val Gly He Tyr Asn 
10 355 360 365 

Gly Thr His Val lie Pro Asn Asp Arg Lys He He Trp Pro Gly Gly 
370 375 380 

15 Glu Thr Glu Lys Pro Arg Gly Tyr Gin Met Ser Thr Arg Leu Lys He 

385 390 395 400 

Val Thr He Kis Gin Glu Pro Phe Val Tyr Val Lys Pro Thr Leu Ser 
405 410 415 

20 

Asp Gly Thr Cys Lys Glu Glu Phe Thr Val Asn Gly Asp Pro Val Lys 
420 425 430 

Lys Val He Cys Thr Gly Pro Asn Asp Thr Ser Pro Gly Ser Pro Arg 
25 435 440 445 

His Thr Val Pro Gin Cys Cys Tyr Gly Phe Cys He Asp Leu Leu He 
450 455 460 

30 Lys Leu Ala Arg Thr Met Asn Phe Thr Tyr Glu Val His Leu Val Ala 
465 470 475 480 

Asp Gly Lys Phe Gly Thr Gin Glu Arc Val Asn Asn Ser Asn Lys Lys 
4 8 5 4 1 1 4 9 5 

35 
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Glu Trp A sr. 

lie Val Ala 

5 515 

Phe Ser Lys 
530 

10 Glu lie Pro 

545 

Thr Leu Trp 

15 

Leu Tyr Leu 

Ser Glu Glu 

20 595 

Phe Ser Trp 
610 

25 Arg Ser Phe 

625 

Met lie lie 

30 

Leu Asp Arg 

Arc A sr. Pro 

35 



Gly Met Me: Gly 

Pro Leu Thr lie 

Pro Phe Lys Tyr 
535 

Arg Ser Thr Leu 

550 

Leu Leu Val Gly 
565 

Leu Asp Arg Phe 

58 0 

Glu Glu Glu Asp 

Gly Val Leu Leu 
615 

Ser Ala Arg lie 

630 

Val Ala Ser Tyr 
645 

Pro Glu Glu Arg 

6 6 C 

Ser Asp Lys Phe 



Glu Leu Leu Ser Gly 

A sr. Asr. Glu Arg Ala 

5 2 0 

Gin Gly Leu Thr lie 
54 0 

Asp Ser Phe Mel Gin 

555 

Leu Ser Val His Val 

570 

Ser Pro Phe Gly Arg 

585 

Ala Leu Thr Leu Ser 

600 

Asn Ser Gly lie Gly 

620 

Leu Gly Met Val Trp 
635 

Thr Ala Asn Leu Ala 

650 

He Thr Gly He Asn 

a a c. 

He Tyr Ala Thr Val 



Gin Ala Asp Met 

Gin Tyr He Glu 

52 5 

Leu Val Lys Lys 

Pro Phe Gin Ser 

560 

Val Ala Val Met 
575 

Phe Lys Val Asn 

590 

Ser Ala Met Trp 

605 

Glu Gly Ala Pro 

Ala Gly Phe Ala 
640 

Ala Phe Leu Val 
655 

Asp Pro Arg Leu 

67: 

Lys Gin Ser Ser 
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Jal Asp He Tyr Phe Arg Arg Glr. Val Glu Leu Ser Thr Met Tyr Arg 



5 His Met Glu Lys His A sr. Tyr Glu Ser Ala Ala Glu Ala He Glr. Ala 

7C5 710 715 721 

Val Arg Asp Asn Lys Leu His Ala ?he lie Trp Asp Ser Ala Val Leu 

725 73 C 735 

10 

Glu Phe Glu Ala Ser Gin Lys Cys Asp Leu Val Thr Thr Gly Glu Leu 
740 745 750 

Phe Phe Arg Ser Gly Phe Gly He Gly Met: Arg Lys Asp Ser Pro Trp 
15 755 760 765 

Lys Gin Asn Val Ser Leu Ser He Leu Lys Ser His Glu Asn Gly Phe 

770 775 73 0 

20 Met Glu Asp Leu Asp Lys Thr Trp Val Arg Tyr Gin Glu Cys Asp Ser 

785 790 795 800 

Arg Ser Asn Ala Pro Ala Thr Leu Thr Phe Glu Asn Met Ala Gly Val 

805 810 315 

25 

Phe Met Leu Val Ala Gly Gly He Val Ala Gly He Phe Leu He Phe 

820 825 830 

He Glu He Ala Tyr Lys Arg His Lys Asp Ala Arg Arg Lys Gin Met 
30 83 5 84 0 84 5 

Gin Leu Ala Phe Ala Ala Val Asn Val Trp Arg Lys Asn Leu Gin Asp 

850 855 en 



35 Arg Lys Ser Gly Arg Ala Glu Pro Asp Pro Lys Lys Lys Ala 7hr Phe 
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A r c A a ~ - e 



Thr Ser Thr Leu Ala Se: Ser Phe Lys Arg Arc Arg Ser 



Ser Lys Asp Thr Ser Thr Gly Gly Gly Arg Gly Ala Leu Glr. Asr. Gin 

9GC 915 ?10 

Lys Asp Thr Val Leu Pro Arg Arg Ala lie Glu Arg Glu Glu Gly Gin 
10 915 920 925 

Leu Gin Leu Cys Ser Arg His Arg Glu Ser 
930 935 



15 



25 



(2) INFORMATION FOR SEQ ID NO : 3 : 



U) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 3 base parrs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(li) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME / KEY : CDS 
(3) LOCATION: 1 . . 63 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

AGT AAA AAA AGG AAC TAT GAA *AC CTC GAC CAA CTG TCC TAT GAC AAC 
Ser Lys Lys Arg Asn Tyr Glu ".sr. Leu Asp Gin Leu Ser Tyr Asp Asr. 

35 



- 83 - 



SD9383 



5 



25 



AAG CGC GGA CCC AAG 
Lys Arc Gly Pro Lys 



(2) INFORMATION* FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 arr.mo acids 
10 ( B ) TYPE: air.ino acid 

(D) TOPOLOGY: linear 

{li) MOLECULE TYPE: protein 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Ser Lys Lys Arg Asn Tyr Glu Asn Leu Asp Gin Leu Ser Tyr Asp Asn 

1 5 10 15 

20 Lys Arg Gly Pro Lys 

2 0 



(2) INFORMATION FOR SEQ ID NO : 5 : 



(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 
30 <D) TOPOLOGY: both 

(li) MOLECULE TYPE: cDNA 

i IX) FEATURE. 
35 f A ) NAM E r. E Y : CDS 
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(XI) SEQVEXCE DESCRIPTION": SEQ 1 13 \'C : 5 : 



5 cccttaataa gatttg:ca: gtacactcga gccatcgcga GTGTCCTTGA GGCGCGGGTG 6C 

ACGGTGGCTC TCGCTGTTCG CGCCCCCTCC TCCCGCGGGG GGAGCCTGAT GCCACGTTCC 120 

CTATGAATTA 7 ITATCG 0C 3 3CCTAAAAAT ACCCCGAACT TCACAGCCCG AGTGACCCTC ISC 

10 

GGGTGGAC ATG GGT 3GG GCC CTG GGG COG GCC CTG TTG CTC AGO TGG CTC 23 0 

Met Gly Gly A La Leu GLy Pro Ala Leu Leu Leu Thr Ser Leu 
1 5 10 

15 TTG GGT GGC TGG GCA GGG CTG GGT CCG GGG GAG GGC GAG GAG GGC ATG 2 78 

Phe Gly Ala Trp Ala Gly Leu GLy Ptd Gly Gin GLy Glu Gin Gly Met 

15 2 J 2 5 3 D 

ACG GTG GCC GTG GTG TTT AGC AGC TGA 3GG GGG GGG GAG GGG GAG TTG 326 
20 Thr Val Ala VaL Val Phe Ser Ser Ser Gly Pro Pro Gin Ala Gin Phe 

35 43 45 

GGT GTG CGG GTG AGG GCG GAG AGC TTG GTG GAG GTA CCG CTG GAG ATG 3 74 

Arg Val Arg Leu Thr Pro Gin Ser Phe Leu Asp Leu Pro Leu Glu lie 
25 53 55 6j 

GAG CCG CTC AC A GTT GGG GTC AAC ACC AGC AAC CCC AGC AGC CTC CTC 422 

Gin Pro Leu Thr Val Gly Val Asn Thr Thr Asn Pro Ser Ser Leu Leu 

65 12 75 

30 

ACC GAG ATC TGC GGC CTC CTG GGT GCT GCC CAC GTC CAC GGC ATT GTC 470 

Thr Gin lie Cys Gly Leu Leu Gly Ala Ala His Val His Gly He Val 



35 TTT G 



AG GAC AAC GTG GAC ACC GAG GCG GTG GCC CAG ATC CT . 
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\so .-.sr. .a* .~.sn 



Ala Val Ala Gin lie leu Asp Phe 



AT 2 TCC TCC CA: 
lie Ser 3er Glr 



VJC CAT GTG T: ATC CTC AGC 
:'r.r His Val Pro lie leu Ser 



GGC TCT 
le Ser Gly Gly Ser 



10 



GCT GTG 



ACC CCC AAG GAG ICG G 



Ala Val Val lej Thr Pro Lys Glu Pro Gly Ser Ala Phe leu 
13 1 13 5 L4D 



JAG GIG 



Glr leu 



614 



15 



GGC GTG TCC CTG GAG CAG CAG CTG CAC 



TTC AAG GTG CTG GAA 



Gly Val Ser Leu GLu Gin Gin lea Gin Val leu Phe lys Val leu Glu 



145 



15 D 



155 



GAG TAG GAC TGG AGC GCC TTC GCC GTC ATC ACC AGC CTG CAC CCG GGC 



GIj Tyr Asp Trp Ser Ala Phe Ala Val 



16 1 



16: 



:ie Thr Ser leu His Pro Gly 
17 1 



662 



710 



20 CAC GCG CTC TTC CTG GAG GGC GTG CGC GCC GTC GCC GAC GCC AGC CAC 
His Ala Leu Phe Leu Glu Gly Val Arg Ala Val Ala Asp Ala Ser His 



175 



130 



135 



1 90 



758 



GTG AGT TGG CGG CTG CTG GAC GTG GTC ACG CTG GAA CTG GAC CCG GGA 
25 Val Ser Trp Arg Leu leu Asp Val Val Thr leu Glu leu Asp Pr:> Gly 



195 



205 



8 06 



30 



GGG CCG CGC GCG CGC ACG CAG CGC CTG CTG CGC CAG CTC GAC GCG CCC 
Gly Pro Arg Ala Arg Thr Gin Arg leu leu Arg Gin leu Asp Ala Pro 

211 215 220 



854 



Val Phe 



GTG GCC TAG TGC TCG CGC GAG GAG GCC GAG GTG CTC TTC GCC 



.a Tyr Cys Ser 



51 u Glu Ala Glu '/a 1 leu Pre Ala 



35 
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\la Ala 

2 4 : 



ly Leu Val 
24 5 



Leu Val 



CCC AAC CTG 3CG CTG 
Pro Asn Leu A 1 a Leu 

255 



G'jC AGC ACu GAT GCG CCC CCC GCC ACC 
Gly Ser Tnr Asp Ala Pro Pro Ala Thr 



Phe Pro 

27 : 



99 3 



5TC GTC ACG GAG AG( 



GTG GGG CTG AT 2 AGC 
10 Val Gly Leu lie Ser Val Val Thr 

275 



2GG CGC CTC AGC CTG CGC 
Ser Trp Arg Leu Ser Leu Arg 

2 8 2 235 



1146 



15 



2AG AAG GTG CGC GAC GGC GTG GCC ATT CTG GCC CTG GGC GCC CAC AGC 
31n Lys Val Arg Asp Gly Val Ala lie Leu Ala Leu Gly Ala His Ser 



2 9 2 



23z 



333 



1094 



20 



TAC TGG CGC GAG CAT GGA ACC CTG CCA GCC CCG G-2 



GAC TGG CGT 



Tyr Trp Arg Gin His Gly Thr Leu Pro Ala Pro Ala Gly Asp Cys Ar: 



3 35 



312 



315 



GTT CAC COT GGG CCC GTC AGC COT GCC CGG GAG GCC TTC TAC AGG CAC 
Val His Pro Gly Pro Val Ser Pro Ala Arg Glu Ala Phe Tyr Arg His 



32 3 



325 



333 



1142 



1190 



25 CTA CTG AAT GTC ACC TGG GAG GGC GGA GAC TTC TCC TTC AGC CGT GGT 
Leu Leu Asn Val Thr Trp Glu Gly Arg Asp Phe Ser Phe Ser Pro Gly 



335 



3 4 J 



345 



35 3 



1238 



30 



GGG TAC CTG GTC CAG 



ACC ATG GTG GTG ATC GCC CTC AAC CGG CAC 



Gly Tyr Leu Val Gin Pro Thr Me: Val Val lie Ala Leu Asn Arg His 



3 6C 



355 



1286 



35 



CGC CTC 
Arg Leu 



1 G G GAG ATG GTG 
:rp Glu Me: Val 

i 7 i 



GGG CGC TGG GAG CAT GGC GTC CTA - AC A _ G 
Gly Arg Trp Glu His Gly Val Leu Tyr Met 
77^ ^ c - 



13 34 
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Arc Tyr Ser Ala Ser Leu Gin Pro Val Val 



3 9C 



355 



GAC ACT CGG CAC CTG ACG GTG GCC ACG CTG GAA GAG CGG CCC TTT GTC 
Asp Ser Arg His Leu Thr Val Ala Thr Leu Glu Glu Arg Pro Phe Val 



143: 



10 ATG GTG GAG AGG GCT GAG GGT GGG ACA GGA GGG TGT GTG GGG AAG ACG 
lie Val Glu Ser Pro Asp Pro Gly Thr Gly Gly Cys Val Pro Asn Thr 



415 



42 j 



425 



43 j 



1478 



GT>: 



i GG GGG AGG GAG AGG AAG GAG ACG TTG AGG AGG GGG GAG GTG 



15 Val Pro Gys Ar 3 Arg 
4 35 



;ln Ser Asn His Thr Phe Ser Ser Gly Asp Val 



44D 



445 



1526 



20 



GCG GGG TAG AC - AAG GTG TGG TGT AAG GGA 



rGC ATG GAG ATG GTG 



Ala Pro Tyr Thr Lys 
450 



Leu Gys Gys Lys Gly Phe Cys lie Asp He Leu 



455 



4 50 



1574 



25 



AAG AAG GTG GCG AG A GTG GTC AAA 



TCG TAG GAG 



TAG GTG GTG 



Lys Lys Leu Ala Arg Val Val Lys Phe Ser Tyr Asp Lei Tyr Leu Val 



4 5 5 



470 



475 



ACG AAG GGG AAG GAT GGG AAG GGG GTG GGC GGG GTA TGG AAG GGG ATG 
Thr Asn Gly Lys His Gly Lys Arg Val Arg Gly Val Trp Asn Gly Met 



4 6 3 



4 35 



49 j 



1622 



1670 



30 ATT GGG GAG GTG TAC TAG AAG CGG GCA GAC ATG GGG AT 0 GGG TCG GTG 
He Gly Glu Val Tyr Tyr Lys Arg Ala Asp Met Ala He Gly Ser Leu 

495 55: 515 511 



1718 



35 



ACG AT: AAT GAG GAA GGG * GG GAG 



. hr lie A. s n G 1 u G 1 u r g Ser 



ATG G . A GA 0 TTG 7C . GTA GCG TTi 
He Val Asp Phe Ser Val Pro Phe 
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10 



GTG GAG ACG GGC A.C AG. G.G A-G G.G GC . CGC AGC AA. GGC ACC G-C - c - v« 

Val Glu Thr Gly He Ser Val Me: Va 1 Ala Arc Ser Asr. Gly Thr Val 

5 3: 535 54: 

TCC CCC TGG GCC TTC TTG GAG CCA TAT AGC CCT GCA GTG TGG GTG ATG 186 2 

Ser Pro Ser Ala Phe Leu Glu Pro Tyr Ser Pro Ala Val Trp Val Me: 

54 5 5 5 J 555 

ATG TTT GTC ATG TGC CTC ACT GTG GTG GCC ATC ACC GTC TTC ATG TTC 1910 

Me: Phe Val Me: Cys Leu Thr Val Val Ala lie Thr Val Phe Met Phe 

560 565 573 



15 GAG TAC TTC AGC CCT GTC AGC TAC AAC CAG AAC CTC ACC AGA GGC AAG 1958 

Glu Tyr Phe Ser Pro Val Ser Tyr Asr Gin Asn Leu Thr Arg Gly Lys 

57 5 5 30 5 35 59 3 

AAG TCC GGG GGC CCA GCT TTC ACT ATC GGC AAG TCC GTG TGG CTG CTG 2 0 06 

20 Lys Ser Gly Gly Pro Ala Phe Thr He Gly Lys Ser Val Trp Leu Lea 

5 9 5 6 :0 6 C 5 

TGG GCG CTG GTC TTC AAC AAC TCA GTG CCC ATC GAG AAC CCG CGG GGC 2 0 54 

Trp Ala Leu Val Phe Asn Asn Ser Val Pro He Glu Asn Pr d Arg Gly 
25 513 615 623 

ACC ACC AGC AAG ATC ATG GTT CTG GTC TGG GCC TTC TTT GCT GTC ATC 2102 

Thr Thr Ser Lys He Met Val Leu Val Trp Ala Phe Phe Ala Val He 

625 63D 635 

30 

TTC CTC GCC AGA TAC ACG GCC AAC CTG GCC GOC TTC ATG ATC :AA GAG 215 1 

Phe Leu Ala Arg Tyr Thr Ala Asn Leu Ala Ala Phe Mel He Gin Glu 

5 4: 645 65: 

35 CAA TAC ATC GAC ACT GTG TCG GGC CTC AGT GA 2 AAG AAG TTT CAG CGG 219 5 
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:yr He Asp Thr Val Ser Gly leu Ser Asp Lys Lys Phe Glr. Arc 



COT CAA GAT GAG TAG GGA CCT TTG GGG TTG GGG AGG GTG CCG AAG GGG 2 24 6 

5 Pro Glr. Asp Glr. Tyr Pro Pro Phe Arc Phe Gly Thr Val Pro Asr. Gly 

6 7 5 5 S 1 6 8 5 

AGG ACG GAG GGG AAG ATC CGC AGT .AAG TAG GGT GAG ATG GAG AGG GAG 22 94 

Ser Thr Glu Arc Asr He Arg Ser Asr Tyr Arg Asp Met His Thr His 

10 69j 695 710 

ATG GTG .AAG TTG AAG GAG CGC TGG GTG GAG GAG GGG CTC ACG AGG GTG 2 3 4 2 

Me: Val Lys Phe Asr Glr Arg Ser Val Glu Asp Ala Leu Thr Ser Leu 

705 71 D 715 

15 

AAG ATG GGG AAG GTG GAT GCC TTG ATG TAT GAT GGT GCT GTG CTC AAC 2 3 90 

Lys Met Gly Lys Leu Asp Ala Phe lie Tyr Asp Ala Ala Val Leu Asr 

7 2 j 725 73j 

20 TAG ATG GCA GGG AAG GAG GAG GGG TGG AAG CTG GTG ACC ATT GGG TGT 2 43 8 

Tyr Met Ala Gly Lys Asp Glu Gly Cys Lys Leu Val Thr He Gly Ser 
735 740 745 753 

GGG AAG GTC TTT GCT ACG ACT GGG TAG GGG ATC GCG ATG GAG AAG GAG 2 4 86 

25 Gly Lys Val Phe Ala Thr Thr Gly Tyr Gly He Ala Met Glr Lys Asp 

755 76 1 765 

TGG GAC TGG AAG GGG GCG ATA GAG CTG GCG CTC TTG CAG TTG CTG GGG 2534 

Ser His Trp Lys Arg Ala He Asp Leu Ala Leu Lei Glr Phe Leu Gl / 

30 7 70 77 5 733 

GAG GGA GAG AGA GAG AAA CTG GAG ACA GTG TGG GTG TCA GGG ATG TGG 2 5 5 2 

Asp Gly Glu Thr Glr Lys Leu Glu Thr /al Trp Leu Ser Gly He Cys 



7 9 0 



35 
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.u Val Me: Ser Ser Lys Leu Asp He Asp A sr. 



5 ATG GCA GGC GTC TTC TAG ATG CTG GTG GTG GCG ATG GGG CTG GCG CTG 
Met Ala Gly Va 1 Phe Tyr Me: leu Leu Va 1 Ala Me: Gly Leu Ala Lej 



TTC GCG 



j AG CAG 



?: TAG TGG AA' 



10 Leu Val ?he Ala Trp Glu His Leu Va 1 Tyr Trp Lys Leu Arg His Ser 



835 



34 j 



34 5 



15 



GTG GCG AAC TCA TCC CAG CTG GAG TTC CTG CTG 



TTC AGC AGG GGC 



Val Pro Asn Ser Ser Gin Leu Asp Phe Leu Leu Ala Phe Ser Arg Gly 



BSD 



355 



361 



:77<; 



20 



ATG TAG AGC TGG TTC AG 



CAG AGC CTC GCC AGC CCA GCG CGG 



lie Tyr Ser Cys Phe Ser Gly Val Gin Ser Leu Ala Ser Pro Pro Arg 



355 



37 j 



37: 



CAG GCC AGC CCG GAC CTC ACG GCC AGC TGG GCC CAG 



AGC GTG CTC 



Gin Ala Ser PrD Asp Leu Thr Ala Ser Ser Ala Gin Ala Ser Val Leu 



■3-30 



835 



393 



2 82 2 



J870 



25 



AAG ATG CTG CAG GCA 



CGC GAC ATG GTG ACC AG 



TA AGC 



Lys Met Leu Gin Ala Ala Arg Asp Me: Val Thr Thr Ala Gly Val Ser 



395 



3 J3 



905 



91 j 



•918 



AGC 



3 AC CGC GCC ACT CGC ACC ATC GAG AA'I 



30 Ser Ser Leu Asp Arg Ala Thr Arg Thr lie Glu Asn Trp Gly 



GGC 
i" l • - 

32 5 



2966 



35 



CGG 



GCG GCC CCA CCG TCC CCG TGG GCG ACC CC 



* la Pro Pro Pro Ser Pro Cys Pro 



-r Pro Arg Ser Gly Pro 



i : : 4 



-91 - 



SD93S? 




AGC CCA TGC CTG CCC ACC CCC GAC CCG CCC CCA GAG CCG AGC CCC ACG 3 16 2 

Ser Pro Cys Leu Pro Thr ?r d Asp Pro Pro Pro Glu Pro Ser Pro Thr 
945 93: 95^ 

5 

GGC TGG GGA CCG CCA GAC GGG GGT CGC GCG GCG CTT GTG CGC AGG GCT 3111 

Gly Trp GLy Pro PrD Asp Gly Gly Arg Ala Ala Leu Val Arg Arg Ala 

?5j 965 979 

10 CCG CAG CCC CCG GGC CGC CCC JCG ACG CCG GGG CCG CCC CTG TCC GAC 3158 

Pro Gin Pro Pro GLy Arg Pro Pro Thr Pro GLy Pro Pro Leu Ser Asp 

975 939 935 999 

GTC TCC CGA GTG TCG CGC CGC CCA GCC TGG GAG GCG CGG TGG CCG GTG 3 2 06 

15 Val Ser Ar? Val Ser Arg Arg Pro ALa Trp Glu Ala Arg Trp Pro Val 

9 9 5 L j J 0 1 J D 5 

CGG ACC GGG CAC TGC GGG AGG CAC CTC TCG GCC TC2 GAG CGG CCC CTG 3^54 

Arg Thr GLy His Cys GLy Arg His Lej. Ser ALa Ser Gl a Arg Pro Leu 

20 L0L0 1JL5 L320 

TCG CCC GCG CGC TGT CAC TAG AGC T:C TTT CCT CGA GCC GAC CGA TCC 3 3 02 

Ser Pro Ala Arg Cys His Tyr Ser Ser Phe Pro Arg Ala Asp Arg Ser 

L 0 * 5 L J 3 0 1035 

25 

GGC CGC CCC TTC CTC CCG CTC TTC CCG GAG CCC CCG GAG CTG GAG GAC 3 3 50 

Gly Arg Pro Phe Le j Pro Leu Phe Pro Glu Pro PrD Gl j Leu Gl: Asp 
L340 L045 1350 

30 CTG CCG CTG CTC GGT CCG GAG CAG CTG GCC CGG CGG GAG GCC CTG CTG 3 3 98 

Leu ?r3 Leu Leu Gly Pro Glu Gin Leu Ala Arg Arg 31 u Ala Leu Le ^ 

::55 1:5: ::65 

CAC GC j GCC TGG GCC :GG GG : TCG CGI CCG CGL CAC GCT TCC CTG CC2 3 446 

35 His Ala Ala Trp Ala Arg Gly Ser Arc Pro Arg His Ala Ser Leu Pro 
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AGC TCC GTG GCC GAG GCC TTC GCT CGG CCC AGO TCG CTG CCC GCC GGG 
Ser Ser Val Ala Glu Ala ?he Ala Arc Pro Ser Ser Leu Pro Ala Gly 



TGC ACG GGC GCC GCC TGC GCC CGC CCC GAC GGA CAC TCG GCC TGC AGG 
Cys Thr Gly Pro Ala Cys Ala Arc Pro Asp Gly His Ser Ala Cys Arg 



3542 



CGC TTG GCG GAG GCG CAG TCG ATG TGC TTG CCG ATC TAG CGG GAG GCC 
Arg Leu A La Gin Ala Gin Ser Met Cys Leo Pro lie Tyr Arg Glu Ala 



1120 



1125 



1130 



3590 



TGC CAG GAG GGC GAG CAG GCA GGG GCC CCC GCC TGG CAG CAC AG A CAG 
Cys Gin Glu Gly Glu Gin Ala Gly Ala Pro Ala Trp Gin His Arg Gin 



113 5 



1140 



1145 



L15C 



3638 



CAC GTG TGC CTG CAC GCC CAC GCC CAC CTG GGA TTT TGC TG 



GGG GCT 



His Val Cys Leu His Ala Hi 
1155 



Ala His Leu Pro Phe Cys Trp Gly Ala 



1160 



1165 



3686 



GTC TGT CCT CAC CTT CCA C 



GCC AGC CAC GGC TCC TG 



:tc TCC 



Val Cys Pro His Leu Pro Pro Cys Ala Ser His Gly Ser Trp Leu Ser 



1170 



1175 



1130 



3734 



GGG GCC TGG GGG CCT CTG GGG CAC AGG GGC AGG ACT CTG GGG GTG GG G 
Gly Ala Trp Gly Pro Leu Gly His Arg Gly Arg Thr Leu Gly Leu Gly 



1 135 



1195 



3782 



AC A GGC TAG AGA GAC AGT GGG GGA CTG GAC 
Thr Gly Tyr Arg Asp Ser Gly Gly Leu Asp 



3 A j ATC AGC AGG GTA GCC 
31 i He Ser Arg Val Ala 



3830 



GGG ACG GAA GGC TTC CCG GGA CCC . G Z AGC .GG AGA CGG ATC GCC 
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Ala Tip Ala Gly Leu Gly Pro Gly Gin Gly Glu Glr. Gly Me: .hr Val 

5 Ala Val Val Phe Ser Ser Ser Gly Pro Pro Glr Ala Gin Phe Arg Val 
3 5 4 0 4 5 

Arg Leu Thr Pro Gin Ser Phe Leu Asp Leu Pro Leu Glu lie Gin Pro 

5 0 5 5 6 0 

10 

Leu Thr Val Gly Val Asn Thr Thr Asn Pro Ser Ser Leu Leu Thr Gin 

65 70 75 80 

lie Cys Gly Leu Leu Gly Ala Ala His Val His Gly lie Val Phe Glu 
15 85 90 95 

Asp Asn Val Asp Thr Glu Ala Val Ala Gin lie Leu Asp Phe lie Ser 

10 0 10 5 110 

20 Ser Gin Thr His Val Pro lie Leu Ser lie Ser Gly Gly Ser Ala Val 
115 120 125 

Val Leu Thr Pro Lys Glu Pro Gly Ser Ala Phe Leu Gin Leu Gly Val 
130 135 140 

25 

Ser Leu Glu Gin Gin Leu Gin Val Leu Phe Lys Val Leu Glu Glu Tyr 
145 150 155 160 

Asp Trp Ser Ala Phe Ala Val lie Thr Ser Leu His Pro Gly His Ala 
30 165 170 175 

Leu Phe Leu Glu Gly Val Arg Ala Val Ala Asp Ala Ser His Val Ser 
ISO 155 19 1 

35 Trp Arg Leu Leu Asp Val Val Thr Leu Glu Leu Asp Pro Gly Gly Pro 
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Arc Ala Arg Thr Gin Arg leu Leu Arc Gin Leu Asp Ala Pro Val Phe 



Val Ala Tyr Cys Ser Arg Glu Glu Ala Glu Val Leu ?he Ala Glu Ala 

225 230 235 24? 

Ala Gin Ala Gly Leu Val Gly Pro Gly His Val Trp Leu Val Pro Asn 
10 245 250 255 

Leu Ala Leu Gly Ser Thr Asp Ala Pro Pro Ala Thr Phe Pro Val Gly 
260 265 270 

15 Leu lie Ser Val Val Thr Glu Ser Trp Arg Leu Ser Leu Arg Gin Lys 
275 280 285 

Val Arg Asp Gly Val Ala lie Leu Ala Leu Gly Ala His Ser Tyr Trp 

290 295 300 

20 

Arg Gin His Gly Thr Leu Pro Ala Pro Ala Gly Asp Cys Arg Val His 
305 310 315 32 0 

Pro Gly Pro Val Ser Pro Ala Arg Glu Ala Phe Tyr Arg His Leu Leu 
25 325 330 335 

Asn Val Thr Trp Glu Gly Arg Asp Phe Ser Phe Ser Pro Gly Gly Tyr 
340 345 350 

30 Leu Val Gin Pro Thr Met Val Val lie Ala Leu Asn Arg His Arg Leu 
3 5 5 3 6 0 36 5 

Trp Glu Met Val Gly Arg Trp Glu His Gly Val Leu Tyr Met Lys Tyr 

371 375 351 

35 
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Glu Ser Pro Asp Pro Gly Thr Gly Gly Cys Val Pro Asn Thr Val Pro 
420 425 430 



Cys Arg Arg Gin 
435 

Tyr Thr Lys Leu 
450 

Leu Ala Arg Val 
465 

Gly Lys His Gly 

Glu Val Tyr Tyr 

500 

Asn Glu Glu Arg 
515 

Thr Gly lie Ser 

530 

Ser Ala Phe Leu 
54 5 

Val Met Cys Leu 



Ser Asn His Thr 
44 0 

Cys Cys Lys Gly 
455 

Val Lys Phe Ser 
470 

Lys Arg Val Arg 
485 

Lys Arg Ala Asp 

Ser Glu lie Val 

520 

Val Met Val Ala 

535 

Glu Pro Tyr Ser 

5 5 0 

Thr Val Val Ala 

5 6 5 



Phe Ser Ser Gly 

Phe Cys lie Asp 
460 

Tyr Asp Leu Tyr 
475 

Gly Val Trp Asn 
490 

Met Ala lie Gly 

505 

Asp Phe Ser Val 

Arg Ser Asn Gly 

54 0 

Pro Ala Val Trp 

He Thr Val Phe 

57 0 



Asp Val Ala Pro 
445 

lie Leu Lys Lys 

Leu Val Thr Asn 

48C 

Gly Met lie Gly 
495 

Ser Leu Thr lie 
510 

Pro Phe Val Glu 
525 

Thr Val Ser Pro 

Val Met Met Phe 

5 6 0 

Met Phe Glu Tyr 
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Phe Ser Pro Val Ser Tyr A sr. Glr. Asr. leu 7hr Arg Gly Lys lys Ser 



5 Gly Gly Pro Ala Phe Thr He Gly Lys Ser Val Trp leu leu 7rp Ala 

5 Q ^ 6 C 0 6 2 5 



leu Val Phe A sr. A sr. Ser Val Pro He Glu Asn Pro Arg Gly Thr Thr 

610 615 620 

10 

Ser Lys He Met Val Leu Val Trp Ala Phe Phe Ala Val He Phe Leu 
625 630 635 640 

Ala Arg Tyr Thr Ala Asn Leu Ala Ala Phe Met He Gin Glu Gin Tyr 
15 645 650 655 

He Asp Thr Val Ser Gly Leu Ser Asp Lys Lys Phe Gin Arg Pro Gin 

660 665 670 

20 Asp Gin Tyr Pro Pro Phe Arg Phe Gly Thr Val Pro Asn Gly Ser Thr 

675 580 685 

Glu Arg Asn He Arg Ser Asn Tyr Arg Asp Met His Thr His Met Val 

690 695 700 

25 

Lys Phe Asn Gin Arg Ser Val Glu Asp Ala Leu Thr Ser Leu Lys Met 

70 5 710 715 720 

Gly Lys Leu Asp Ala Phe lie Tyr Asp Ala Ala Val Leu Asn Tyr Met 
30 725 730 735 

Ala Gly Lys Asp Glu Gly Cys Lys Leu Val Thr He Gly Ser Gly Lys 

741 745 75: 

35 Val Phe Ala Thr Thr Gly Tyr Gly He Ala Met Glr. Lys Asp Ser His 
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Trp Lys Arg Ala He Asp Leu Ala Leu Leu Gin ?he Leu Gly Asp Gly 

771 775 7 S G 

5 

Glu Thr Gin Lys Leu Glu Thr Val Trp Leu Ser Gly He Cys Gin Asr. 

755 790 7?5 SZZ 

Glu Lys Asr. Glu Val Met Ser Ser Lys Leu Asp He Asp Asr. Met Ala 
10 805 810 815 

Gly Val Phe Tyr Met Leu Leu Val Ala Met Gly Leu Ala Leu Leu Val 
820 825 830 

15 Phe Ala Trp Glu His Leu Val Tyr Trp Lys Leu Arg His Ser Val Pro 

835 840 845 

Asr Ser Ser Gin Leu Asp Phe Leu Leu Ala Phe Ser Arg Gly He Tyr 
850 855 860 

20 

Ser Cys Phe Ser Gly Val Gin Ser Leu Ala Ser Pro Pro Arg Gin Ala 

865 870 875 880 

Ser Pro Asp Leu Thr Ala Ser Ser Ala Gin Ala Ser Val Leu Lys Met 
25 885 890 895 

Leu Gin Ala Ala Arg Asp Met Val Thr Thr Ala Gly Val Ser Ser Ser 

900 905 910 

30 Leu Asp Arg Ala Thr Arg Thr He Glu Asr Trp Gly Gly Gly Arg Arg 

915 92 0 92 5 

Ala Pro Pro Pro Ser Pro Cys Pro Thr Pro Arg Ser Gly Pro Ser Pro 
931 935 941 

35 
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Cys leu Pro Thr Pro Asp Pro Pro Pro Glu Pro Ser Pro Thr Gly Trp 



Gly Pro Pro Asp Gly Gly Arg Ala Ala Leu Val Arg Arg Ala Pro Gin 
5 965 971 975 

Pro Pro Gly Arc Pro Pro Thr Pro Gly Pro Pro Leu Ser Asp Val Ser 

98: 38 = 990 

10 Arg Val Ser Arg Arg Pro Ala Trp Glu Ala Arg Trp Pro Val Arg Thr 

995 10C0 1QG5 

Gly His Cys Gly Arg His Leu Ser Ala Ser Glu Arg Pro Leu Ser Pro 
1010 1015 102 0 

15 

Ala Arg Cys His Tyr Ser Ser Phe Pro Arg Ala Asp Arg Ser Gly Arg 

1025 1030 1035 1040 

Pro Phe Leu Pro Leu Phe Pro Glu Pro Pro Glu Leu Glu Asp Leu Pro 
20 1045 1050 1055 

Leu Leu Gly Pro Glu Gin Leu Ala Arg Arg Glu Ala Leu Leu His Ala 
1060 1065 1070 

25 Ala Trp Ala Arg Gly Ser Arg Pro Arg His Ala Ser Leu Pro Ser Ser 

10 7 5 10 8 0 1 C 8 5 

Val Ala Glu Ala Phe Ala Arg Pro Ser Ser Leu Pro Ala Gly Cys Thr 
1090 1095 1100 



30 



Gly Pro Ala Cys Ala Arg Pro Asp Gly His Ser Ala Cys Arg Arg Le_ 



35 



Ala Glr. Ala Glr. Ser Met: Cys leu Pre :le Tyr Arc Glu Ala Cys Glr. 
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Glu Sly Glu Gin Ala Gly Ala Pro Ala Trp Glr. His Arg Gin His Val 



5 Cys Leu His Ala His Ala His Leu Pro Phe Cys Trp Gly Ala Val Cys 
1155 1160 1165 

Pro His Leu Pro Pro Cys Ala Ser His Gly Ser Trp Leu Ser Gly Ala 
1170 1175 11SC 

10 

Trp Gly Pro Leu Gly His Arg Gly Arg Thr Leu Gly Leu Gly Thr Gly 
1185 1190 1195 1200 

Tyr Arg Asp Ser Gly Gly Leu Asp Glu lie Ser Arg Val Ala Arg Gly 
15 1205 1210 1215 

Thr Gin Gly Phe Pro Gly Pro Cys Thr Trp Arg Arg lie Ser Ser Leu 
1220 1225 1230 

20 Glu Ser Glu Val 

1235 



25 



(2) INFORMATION FOR SEQ ID NO : 7 : 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS : both 
30 (D) TOPOLOGY: both 

(n) MOLECULE TYPE: c ON A 

(ix) FEATVRE: 
35 (A) NAME KEY: CDS 
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20 



(3) LOCATION: 



;xi! SEQUENCE DESCRIPTION: SLQ 10 NO:' 

5 C TCT GAG GCT CAG CCT GTC CCC AG 

Ser Glu Ala Gin Pro Val Pro 



10 (2) INFORMATION FO? SEQ ID NO : 8 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(n) MOLECULE TYPE: protein 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ser Glu Ala Gin Pro Val Pro 
1 E 



25 (2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRAJOEDNESS : unknown 

(D) TOPOLOGY : unknown 

(:i) MOLECULE TYPE : cDNA 
35 (xi i SEQUENCE DESCRIPTION: SEQ ID NO : =■ : 
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{2} INFORMATION FOR SEQ ID NO : 1 Z : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 30 8 base pairs 
(3) TYPE: nucleic acid 
{ C ) STRANDEDNESS : both 
(D) TOPOLOGY: both 

(li) MOLECULE TYPE : cDNA 

(IX) FEATURE: 

(A) NAME 'KEY: CDS 

(B) LOCATION : 311 . .4705 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 1 0 : 

ATCATGGGAC CGGGTGAGCG CTGAGAATCG CGGCCGCAGC CATCAGCCCT GGAGATGACC 

AGGAGCGGCC ACTGCTGAGA ACTATGTGGA GAGAGGCTGC GAGCCCTGCT GCAGAGCCTC 

CGGCTGGGAT AGCCGCCCCC CGTGGGGGCG ATGCGGACAG CGCGGGACAG CCAGGGGAGC 

GCGCTGGGGC CGCAGCATGC GGGAACCCGC TAAACCCGGT GGCTGCTGAG GCGGCCGAGA 

TGCTCGTGCG CGCAGCGCGC CCCACTGCAT CCTCGACCTT CTCGGGCTAC AGGGACCGTC 

AGTGGCGACT ATG GGC AGA GTG G3C TAT TGG ACC CTG CTG GTG CTG CCG 
yet Gly Arg Val Gly Tyr Trp Thr Leu Leu Val Leu Pro 

Gil CTT CTG GTC TGG CGI GGT CCG GCG CCG AGC GCG GIG GIG GAG AAG 
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■ ex- - - y 



3 tr r A ^ a ."i _ a .-. - c 



CCC CCC GCG CTA AAT ATT GCG 



jGT AAC AGG CAC GAC 



Gly Pro Pro Ala Leu Asn lie Ala VaL Met Leu Gly His Ser His Asp 



3 : 



4 : 



44 5 



10 



jTG AGA GAG G jC GAA CTT GGA ACA CTG TGG GGC ZCC 
Val Thr Glu Arg Glu Leu Arg Thr Lei Trp Gly Pro 

5 0 5 5 



j AG CAG GGG GCG 
Glu Gin Ala Ala 

6 1 



493 



15 



3GG CTG CCC CTG GAC GTG AAC GTG GTA GCT GTG GTG ATG AAG CGC ACC 
3ly Leu Pro Leu Asp Val Asn Val Val Ala Leu Leu Met Asn Arg Thr 

65 70 75 



SAC CCC AAG AC 



:TC ATC ACG CAC GTG TGG GAC CTC ATG TCC GGG GCA 



Asp Pro Lys Ser Leu lie Thr His Val Cys Asp Leu Me: Ser Gly Ala 

83 35 9: 



541 



589 



20 CGC ATC CAC GGG CTC GTG TTT GGG GAG GAC ACG GAC CAG GAG GCC GTA 
Arg lie His Gly Leu Val Phe Gly Asp Asp Thr Asp Gin Glu Ala Val 

9 5 100 135 



637 



GCC CAG ATG CTG GAT TTT ATC TCC TCC CAC ACC TTC GTC CCC ATC TTG 
25 Ala Gin Met Leu Asp Phe lie Ser Ser His Thr Phe Val Pro He Leu 

111' 115 12j 125 



685 



30 



GGC ATT CAT GGG GGC GCA TCT ATG ATC ATG GCT GAC AAG GAT CCG ACG 
Gly He His Gly Gly Ala Ser Met lie Met Ala Asp Lys Asp Pro Thr 



13 D 



135 



140 



733 



35 



ACC TTC TTC CAG TTT GGA GCG TCC ATC CAG CAG CAA GCC ACG 
Thr Pre ?ne Gin Phe Gly Ala Ser lie '"Jin Gin Gin Ala Thr 
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_eu Lys He Met G.r. Asp 



is Val Phe Ser Leu Val 



ACC ACT ATC 



GGC TAG AGG GAA TTC ATC AGC TTG GTG AAG ACC 



Pne ?r3 Gly Tyr Arg 

is: 



Phe 1H 



Ser Phe Val Lys Thr 



10 



AC A GTG GAG AAC AGC TTT GTG GGC TGG GAG ATG CAG AAT GTG ATC AC A 
Thr Val Asp Asn Ser Phe Val Gly Trp Asp Mez Gin Asn Val lie Thr 



19 0 



195 



200 



J D 5 



925 



15 



CTG GAG ACT TCC TTT GAG GAT GCA AAG ACA CAA GTC CAG CTG AAG AAG 
Leu Asp Thr Ser Phe Glu Asp Ala Lys Thr Gin Val Gin Leu Lys Lys 



213 



215 



220 



973 



20 



ATC CAC TCT TTT GTC ATC TTG CTC TAG TGT TCC AAA GAG GAG GCT GTT 
lie His Ser Ser Val lie Leu Leu Tyr Cys Ser Lys Asp Glu Ala Val 



225 



233 



235 



CTC ATT CTG AGT GAG GCC CGC TCC CTT GGC CTC ACC GGG TAT GAT TTC 



Leu He Leu Ser Glu Ala Arg Ser Leu 



24 0 



245 



Leu Thr Gly Tyr Asp Phe 
25 j 



1C21 



1C69 



25 TTC TGG ATT GTC CCC AGC TTG GTC TCT GGG AAG AGG GAG CTC ATC CCA 

Phe Trp He Val Pro Ser Leu Val Ser Gly Asn Thr Glu Leu He Pro 



255 



261 



265 



1117 



AAA GAG TTT CCA 
30 Lys Glu Phe Pro 

270 



3GA CTC ATT TCT 
Sly Leu 



rT TCT GTC TCC TAG GAT GAG TGG GA'I 



275 



Ser Val Ser Tyr Asp Asp Trp Asp 

2 8 0 2 3 5 



1165 



35 



TAC AGC CTG GAG GCG AG A GTG AGG GAG GGC ATT GGC ATC CTA ACC ACC 
Tyr Ser Leu Glu Ala Arg Val Arg Asp Gly He Gly lie Leu Thr Thr 

2 9 0 2 9 5 3 : : 
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GCT GCA TCT TCT ATG 3TG GAG AAG TTC TCC TAG ATC GGG GA3 GCC AAG 126 1 

Ala Ala Ser Ser Me: le; Glu Lys Phe Ser Tyr He Pro Gil Ala Lys 

3C5 31 :■ 315 

5 

GCC AGC TGC TAT GGG GAG ATG GAG AGG GCA GAG GTC GGG AT 3 CAC AGG 13 39 

Ala Ser Gys Tyr Gly Gin Met Gli Arg ?:d Glu Val Pro Met His Thr 

323 325 330 

10 TTG GAG GCA TTT ATG GTC AAT GTT ACA TGG GAT GGC AAA GAG TTA TCC 13 57 

leu His Pro Phe Met Val Asn Val Thr Trp Asp Gly Lys Asp Leu Ser 

335 343 345 

TTC ACT GAG GAA GGC TAG CAG GTG CAC CCC AGG CTG GTG GTG ATT GTG 1405 

15 Phe Thr Glu Glu Gly Tyr Gin Val His Pro Arg Leu Val Val He Val 
35 0 355 36 0 365 

CTG AAC AAA GAG CGG GAA TGG GAA AAG GTG GGC AAG TGG GAG AAC CAT 14 53 

Leu Asn Lys Asp Arg Glu Trp Glu Lys Val Gly Lys Trp Gl a Asn His 
20 373 375 383 

AC "3 CTG AGC CTG AGG CAC GCC GTG TGG CCC AGG TAC AAG TCC TTC TCC 1501 

Thr Lei Ser Leu Arg His Ala Val Trp Pro Arg Tyr Lys Ser Phe Ser 

385 393 395 

25 

GAG TGT GAG CCG GAT GAG AAC CAT CTC AGC ATG GTC ACC CTG GAG GAG 154 9 

Asp Cys Glu Pro Asp Asp Asn His Leu Ser lie Val Thr Leu Glu Glu 
4 0 3 4 0 5 4 10 

30 GCC CCA TTC GTC ATG GTG GAA GAC ATA GAC CCC CTG ACC GAG ACG TGT 15 97 

Ala PrD Phe Val He Val Glu Asp He Asp Pro Leu Thr Glu Thr Cys 

4 15 4 2 3 4 2 5 

GTG AGG AAC ACC GTG CCA TGT CGG AAG TTC GTC AAA ATC AAC AAT TCA 164 5 

35 Val Arg Asn Thr Val Pro Cys Arc Lys Phe Val Lys He Asn Asn Ser 
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ACC AAT GAG GGG ATG AAT GTG AAG AAA TGC TGC AAG GGG TTC TGC ATT 16 93 

Thr Asr. Glu Gly Met A sr. Va 1 Lys Lys Cys Cys Lys Gly Phe Cys He 

5 45: 455 46C 

GAT ATT CTG AAG AAG CTT TCC AGA ACT GTG AAG TTT ACT TAG GAC CTC 1741 

Asp lie Leu Lys Lys Leu Ser Arg Thr Va 1 Lys ?he Tr.r Tyr Asp Leu 

465 475 475 

10 

TAT CTG GTG ACC AAT GGG AAG CAT GGC .AAG AAA GTT AAC AAT GTG TGG 1789 

Tyr Leu Va 1 Thr Asr Gly Lys His Gly Lys Lys Val Asn Asn Val Trp 

4 8 J 4 3 5 4 9 0 

15 AAT GGA ATG ATC GGT GAA GTG GTC TAT CAA CGG GCA GTC ATG GCA GTT 183 7 

Asn Gly Me: He Gly Glu Val Val Tyr Gin Arg Ala Val Met Ala Val 

495 503 505 

GGC TCG CTC ACC ATC .AAT GAG G.AA CGT TCT GAA GTG GTG GAC TTC TCT 188 5 

20 Gly Ser Leu Thr He Asn Glu Glu Arg Ser Glu Val Val Asp Phe Ser 

510 515 520 525 

GTG CCC TTT GTG GAA ACG GGA ATC AGT GTC ATG GTT TCA AGA AGT AAT 193 3 

Val Pro Phe Val Glu Thr Gly lie Ser Val Met Val Ser Arg Ser Asn 

25 530 535 540 

GGC ACC GTC TCA CCT TCT GCT TTT CTA GAA CCA TTC AGC GCC TCT GTC 1981 

Gly Thr Val Ser Pro Ser Ala Phe Leu Glu Pro Phe Ser Ala Ser Val 

545 550 555 

30 

TGG GTG ATG ATG TTT GTG ATG CTG CTC ATT GTT TCT GCC ATA GCT GTT 2 02 9 

Trp Val Met Met Phe Val Met Leu Leu lie Val Ser Ala lie Ala Val 



35 TGG GTC TTG GAT TAG TCC AGO CCT GTT GGA TAC AAC AGA AAC TTA GCC 2 0 77 
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\sp Tyr Ser Ser 



3-y As: 



AAA GGG AAA GCA CCC CAT GGG CCT TCT TTT AC A ATT GGA AAA GCT ATA 2125 

5 Lys Gly Lys Ala Pro His Giy Pro Ser ?he Thr He Gly Lys Ala He 

593 595 6 : C 605 

TGG CTT CTT TGG GGC CTG GTG TTC AAT AAC T ZC GTG CCT GTC CAG AAT 2173 

Trp Leu leu Trp Gly Leu Val Phe Asn Asn Ser Val Pro Val Gin Asn 

10 612 515 623 

CCT AAA GGG ACC ACC AGC AAG ATC ATG GTA TCT GTA TGG GCC TTC TTC 2 221 

Pro Lys Gly Thr T'nr Ser Lys He Met Val Ser Val Trp Ala Phe Phe 

625 630 635 

15 

GCT GTC ATA TTC CTG GCT AGC TAG ACA GCC AAT CTG GCT GCC TTC ATG 22 6 9 

Ala Val He Phe Leu Ala Ser Tyr Thr Ala Asn Leu Ala Ala Phe Met 

64 3 645 65 2 

20 ATC CAA GAG GAA TTT GTG GAC CA^ GTG ACC GGC CTC AGT GAC AAA AAG 2317 

He Gin Glu Glu Phe Val Asp Gin Val Thr Gly Leu Ser Asp Lys Lys 

655 650 665 

TTT CAG AGA CCT CAT GAC TAT TCC CCA CCT TTT CGA TTT GGG ACA GTG 23 65 

25 Phe Gin Arg Pro His Asp Tyr Ser Pro Pro Phe Arg Phe Gly Thr Val 

673 575 530 685 

CCT AAT GGA AGC ACG GAG AGA AAC ATT CGG AAT AAC TAT CCC TAG ATG 2413 

Pro Asn Gly Ser Thr Glu Arg Asn He Arg Asn Asn Tyr Pro Tyr Met 

30 693 695 723 

CAT CAG TAG ATG ACC AAA TTT AAT CAG AAA GGA GTA 3 AG GAC GCC TTG 24 61 

His Gin Tyr Met Thr Lys Phe Asn Gin Lys Gly Val Glu Asp Ala Leu 



35 
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GTG TTG AAT TAC AAG GCT GGG AGG GAT GAA GGC TGC AAG CTG GTG ACC 2 5 57 

Val Leu Asn Tyr Lys Ala Gly Arg Asp Glu Gly Cys Lys leu Val Thr 

7 3 5 7 4: 7-15 

ATG GGG AGT GGG TAC ATG TTT GGC ACC ACC GGT TAT GGA ATT GCC CTT 2 6 7 5 

He Gly Ser Gly Tyr Tie Phe Ala Thr Thr Gly Tyr Gly He Ala Leu 

753 755 763 765 

GAG AAA GGC TCT CCT TGG AAG AGG CAG ATC GAG CTG GCC TTG CTT CAG 2653 

Gin Lys Gly Ser Pro Trp Lys Arg Gin lie Asp Leu Ala Leu Leu Gin 

773 775 783 

TTT GTG GGT GAT GGT GAG ATG GAG GAG CTG GAG ACC CTG TGG CTG ACT 2 701 

Phe Val Gly Asp Gly Glu Met Glu Glu Leu Glu Thr Leu Trp Leu Thr 

785 793 795 

GGG ATC TGC CAG AAC GAG AAG AAC GAG GTG ATG AGO AGC CAG CTG GAG 2 74 9 

Gly He Cys His Asn Glu Lys Asn Glu Val Met Ser Ser Gin Lea Asp 

8 3 3 3 35 3 13 

ATT GAC AAC ATG GGG GGC GTA TTC TAC ATG CTG GCT GCC GCC ATG GCC 27 97 

lie Asp Asn Met Ala Gly Val Phe Tyr Met Leu Ala Ala Ala Met Ala 

815 823 3 25 

CTT AGC CTC ATC ACC TTC ATC TGG GAG CA3 CTG TTC TAC TGG AAG CTG 2 845 

Lei Ser Leu He Thr Phe lie Trp Glu His Le ^ Phe Tyr Trp Lys Leu 

333 335 841 845 



\rg Phe Cys Phe Thr 31 y Val Cys Ser Asp Arg Pro Gly Leu Leu Phe 
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TCC ATG AGC AGG GGC ATG TAG AGG TGG ATT CAT GGA GTG CAG ATT GAA 
Ser 1 1 -2 Ser Arc Gly He Tyr Ser Cys He His Gly Va 1 His He Glu 



3:: 



GAA AAG AAG AAG TCT CCA GAG TTG AAT GTG ACG GGA TGG GAG AGC AAC 2 559 

Glu Tys Tys Lys Ser Pro Asp ?he Asn Leu Thr Gly Ser Gin Ser A sr. 

S S 1 6 8 5 8 ? : 

10 ATG TTA AAA CTG GTC CGG TCA GCG AAA AAC ATT TCC AGG ATG TGG AAG 3C37 

Met: Leu Lys Leu Leu Arg Ser Ala Lys Asn He Ser Ser Met Ser Asn 

895 903 935 

ATG AAG TCC TGA AGA ATG GAG TCA CCG .AAA AGA GGT GGT GAG TTG ATG 3C85 

15 Met: Asn Ser Ser Arg Met Asp Ser Pro Lys Arg Ala Ala Asp Phe He 

913 915 920 925 

GAA AGA GGT TGG GTC ATG ATG GAG ATG GTT TCA GAT AAG GGG AAT TTG 3133 

Gin Arg Gly Ser Leu He Met Asp Met Val Ser Asp Lys Gly Asn Leu 

20 930 935 943 

ATG TAG TCA GAG AAC AGG TCC TTT CAG GGG AAA GAG AGG ATT TTT GGA 3181 

Me: Tyr Ser Asp Asn Arg Ser Phe Gin Gly Lys Glu Ser He Phe Gly 
945 953 955 

25 

GAG AAG ATG AAC GAA CTG GAA ACA TTT GTG GCC AAG CGG GAG AAG GAT 3 22 9 

Asp Asn Met Asn Glu Leu Gin Thr Phe Val Ala Asn Arg Gin Lys Asp 

953 965 970 

30 AAC CTG AAT AAG TAT GTA TTG GAG GGA GAA CAT GGT CTT ACT CTG AAT 3 277 

Asn Leu Asn Asn Tyr Val Phe Gin Gly Gin His Pro Leu Thr Leu Asn 

975 983 935 

j AG TCG .AAG CCT AAC ACG GTG GAG GTG 3CC GTG AGC AGA GAA TCC AAA 3 3 2 5 

35 Glu Ser Asn Pro Asn Thr Val Glu Val Ala Val Ser Thr Glu Ser Lys 
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10 



GCG AAC TCT AG A CCC CGG CAG CTG TGG AAG AAA TCC GTG GAT TCC ATA 
Ala A sr. Ser Arg Pro Arc Glr Leu Trp Lys Lys Ser Va 1 Asp Ser He 



CGC GAG GAT TCA GTA TGG CAG AAT CCA GTG TGC CAG AGG GAT GAG GCA 3421 

Arg Glr Asp Ser Leu Ser Glr Asr Pro Val Ser Glr Arg Asp Glu Ala 

1025 1 3 C 103 5 

AC A GCA GAG AAT AGG ACC CAC TCC CTA AAG AGC GCT AGG TAT GTT CCA 34 69 

Thr Ala Glu Asn Arg Thr His Ser Leu Lys Ser Pro Arg Tyr Leu Pro 

134 0 1045 10 50 



15 GAA GAG ATG GCC GAC TCT GAC ATT TCA GAA ACG TCA AAT CGG GCC ACG 3 517 

Glu Glu Met Ala His Ser Asp He Ser Glu Thr Ser Asr Arg Ala Thr 
1055 1060 1065 

TGG CAC AGG GAA GCT GAG AAC AGT AAG AAC CAC AAA ACC AAG GAC AAC 3 5 65 

20 Cys His Arg Glu Pro Asp Asn Ser Lys Asn His Lys Thr Lys Asp Asn 
1070 1075 1080 1035 

TTT AAA AGG TCA GTG GCC TCC AAA TAC CCC AAG GAC TGT AGT GAG GTG 3613 

Phe Lys Arg Ser Val Ala Ser Lys Tyr Pro Lys Asp Cys Ser Glu Val 

25 1090 1395 1100 

GAj CGC ACC TAC GTG AAA ACC AAA TCA AGC TCC OCT AGA GAC .AAG ATC 3 661 

■31 j Arg Thr Tyr Leu Lys Thr Lys Ser Ser Ser Pro Arg Asp Lys He 

110 5 1110 1115 



30 



TA0 ACT ATA GAT GGT GAG AAG GAG OCT GGT TTC CAC TTA GAT CCA CCC 3 709 

Tyr Thr He Asp Gly Glu Lys Glu Pro Gly Phe His Leu Asp Pro Pro 



35 OAG TTT 3TT GAA AAT GTG ACC CTG C 0C GAG AAC GTG GAC TTC CCG GAC 
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Thr Leu Pre 31- Asr. Val Asp Pr.e Pre Asp 



CCC TAC CAG GAT CCC AGT G.AA AAC TTC CGC AAG GGG GAC TCC ACG CTG 3S15 

5 Pro Tyr Gin Asp Pro Ser Glu Asr. Phe Arg Lys Gly Asp Ser Thr leu 

1150 1155 1161 1165 

CCA ATG AAC CGG AAC CCC CTG CAT AAT GAA GAG GGG CTT TCC AAC AAC 3S53 

Pro Me: Asr. Arg Asr Pro Leu His Asr Glu Glu Gly leu Ser Asr. Asr. 

10 1170 1175 113C 

GAC CAG TAT AAA CTC TAC TCC AAG CAC TTC ACC TTG AAA GAC AAG GGT 3 901 

Asp Gin Tyr Lys Leu Tyr Ser Lys His Phe Thr Leu Lys Asp Lys Gly 
1185 1190 1195 

15 

TCC CCG CAC AGT GAG ACC AGC GAG CGA TAC CGG CAG AAC TCC ACG CAC 3 94 9 

Ser Pro His Ser Glu Thr Ser Glu Arg Tyr Arg Gin Asn Ser Thr His 
12D0 1205 1210 

20 TGC AGA AGC TGC CTT TCC AAC ATG CCC ACC TAT TCA GGC CAC TTC ACC 3 9 97 

Cys Arg Ser Cys Lej Ser Asn Met Pro Thr Tyr Ser Gly His Phe Thr 
1215 1220 1225 

ATG AGG TCC CCC TTC AAG TGC GAT GCC TGC CTG CGG ATG GGG AAC CTC 4 04 5 

25 Met Arg Ser Pro Phe Lys Cys Asp Ala Cys Leu Arg Met Gly Asn Leu 

1230 1235 1240 1245 

TAT GAC ATC GAT GAA GAC CAG ATG CTT CAG GAG ACA GGT AAC CCA GCC 4093 

Tyr Asp lie Asp Glu Asp Gin Met Leu Gin Glu rhr Gly Asn PrD Ala 

30 1250 1255 1260 

ACC GGG GAG CAG GTC TA0 CAG CAG GAC TGG OCA CAC AAC AAT GCC CTT 414 1 

Thr Gly Glu Gin Val Tyr Gin Gin Asp Trp Ala Gin Asn Asr. Ala Leu 



35 
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Gin leu Glr. Lys A sr. Lys Leu Arg He Ser Arg Gin His Ser Tyr Asp 



5 AAC ATT GTC GAC AAA CCT AGG GAG CTA GAC CTT AGO AGG CCC TCC CGG 423 7 

Asn He Val Asp Lys Pro Arg Glu Leu Asp Leu Ser Arg Pro Ser Arg 

12 9 5 1 3 1 j 1315 

AGG ATA AGG CTG AAG GAG AGG GAA CGG CTT :TG GAG GGA AAT TTT TAC 42S5 

10 Ser He Ser Leu Lys Asp Arg Glu Arg Leu Leu Glu Gly Asn Phe Tyr 

1310 1315 1320 1325 

GGC AGC CTG TTT AGT GTC CCC TCA AGC AAA CTC TCG GGG AAA AAA AGC 4333 

Gly Ser Leu Phe Ser Val Pro Ser Ser Lys Leu Ser Gly Lys Lys Ser 

15 1330 1335 1340 

TCC CTT TTC CCC CAA GGT CTG GAG GAC AGC AAG AGG AGC AAG TCT CTC 4 3 81 

Ser Leu Phe Pro Gin Gly Leu Glu Asp Ser Lys Arg Ser Lys Ser Leu 

1345 135 0 13 5 5 

20 

TTG CCA GAC CAC ACC TCC GAT AAC CCT TTC CTC CAC TCC CAC AGG GAT 4 42 9 

Lea Pro Asp His Thr Ser Asp Asn Pro Phe Leu His Ser His Arg Asp 

1350 1365 1370 

25 GAC CAA CGC TTG GTT ATT GGG AGA TGC CCC TCG GAC CCT TAC AAA CAC 44 77 

Asp Gin Arg Leu Val He Gly Arg Cys Pro Ser Asp Pro Tyr Lys His 

1375 1380 1335 

TCG TTG CCA TCC CAG GCG GTG AAT GAC AGC TAT CTT CGG TCG TCC TTG 4 52 5 

30 Ser Leu Pro Ser Gin Ala Val Asn Asp Ser Tyr lei Arg Ser Ser Leu 

1390 13 95 14H 1415 

AGG TCA AC 3 GCA TCG TA 2 TGT ICC AGG GA 2 AGT 2G2 GGC CAC AAT GAT 45 73 

Arg Ser Thr Ala Ser Tyr Cys Ser Arg Asp Ser Ars Gly His Asn Asp 



35 
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5 



GTG TAT ATT TCG GAG CAT GTT ATG CCT TAT GCT GCA AAT AAG AAT AAT 4621 

Val Tyr lie Ser Glu His Val Me: Pro Tyr Ala Ala Asr. Lys A sr. Asn 
14 2 5 14 3: 1435 

ATG TAG TCT ACT CCC AGG GTT TTA AAT TCC TGC AGC AAT AGA CGC GTG 4 66 9 

Met Tyr Ser Thr Pre Arg Val Leu Asr Ser Cys Ser Asr Arc Arg Val 
1440 1445 2450 



10 TAC AAG GAA ATG CCT AGT ATC GAA TCT GAT GTT TAAAAATCTT CCATTAATGT 4 72 2 

Tyr Lys Glu Met Pro Ser lie Glu Ser Asp Val 

1455 1460 146 

TTTATCTATA GGGAAATACA CGTAATGGCC AATGTTCTGG AGGGTAAATG TTGGATGTCC 47 82 



15 



AATAGTGCCC TGCTAAGAGG AAGGAG 4 80 8 



(2) INFORMATION FOR SEQ ID NO : 1 1 : 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1464 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 

(li) MOLECULE TYPE: protein 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 

30 Met Gly Arg Val Gly Tyr Trp Thr Leu Leu Val Leu Pro Ala Leu Leu 

c i n 2. 5 

Val Trp Arg Gly Pro Ala Pro Ser Ala Ala Ala Glu lys Gly Pro Pro 

21 25 31 

35 
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Ala Le. Asr. lie Ala Val Me: Leu Gly His Ser His Asp Val . hr 



Arg Glu Leu Arg Thr leu Trp Gly Pro Glu Gin Ala Ala Gly Leu Pro 



Leu Asp Val Asn Val Val Ala Leu Leu Met Asr. Arg Thr Asp Pro Lys 

65 70 75 80 

10 Ser Leu lie Thr His Val Cys Asp Leu Met Ser Gly Ala Arg He His 

85 90 95 

Gly Leu Val Phe Gly Asp Asp Thr Asp Gin Glu Ala Val Ala Gin Met 

100 105 110 

15 

Leu Asp Phe He Ser Ser His Thr Phe Val Pro He Leu Gly He His 

115 12 0 125 

Gly Gly Ala Ser Met He Met Ala Asp Lys Asp Pro Thr Ser Thr Phe 
20 130 135 140 

Phe Gin Phe Gly Ala Ser He Gin Gin Gin Ala Thr Val Met Leu Lys 
145 150 155 160 

25 He Met Gin Asp Tyr Asp Trp His Val Phe Ser Leu Val Thr Thr He 

165 170 175 

Phe Pro Gly Tyr Arg Glu Phe He Ser Phe Val Lys Thr Thr Val Asp 

180 185 190 



30 



Asn Ser Phe Val Gly Trp Asp Met Gin Asn Val He Thr Leu Asp 



35 



Ser Phe Glu Asp Ala Lys Thr Gin Val Gin Leu Lys Lys He His Ser 
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Ser 7 a 1 He Leu Leu Tyr Cys Ser Lys Asp Glu Ala Val Leu lie Leu 

5 Ser Glu Ala Arg Ser Leu Gly Leu Thr Gly Tyr Asp ?he Phe Trp lie 

245 25C 255 

Val Pro Ser Leu Val Ser Gly As- Thr Glu Leu He Pro Lys Glu Phe 

260 265 270 

10 

Pro Ser Gly Leu He Ser Val Ser Tyr Asp Asp Trp Asp Tyr Ser Leu 
275 280 285 

Glu Ala Arg Val Arg Asp Gly He Gly He Leu Thr Thr Ala Ala Ser 
15 290 295 300 

Ser Met Leu Glu Lys Phe Ser Tyr He Pro Glu Ala Lys Ala Ser Cys 

305 310 315 320 

20 Tyr Gly Gin Met Glu Arg Pro Glu Val Pro Met His Thr Leu His Pro 

325 330 335 

Phe Met Val Asn Val Thr Trp Asp Gly Lys Asp Leu Ser Phe Thr Glu 
340 345 350 

25 

Glu Gly Tyr Gin Val His Pro Arg Leu Val Val He Val Leu Asn Lys 
355 360 365 

Asp Arg Glu Trp Glu Lys Val Gly Lys Trp Glu Asn His Thr Leu Ser 
30 37 0 375 380 

Leu Arg His Ala Val Trp Pro Arg Tyr Lys Ser Phe Ser Asp Cys Glu 

355 390 395 400 

35 Pro Asp Asp Asn His Leu Ser He Val Thr Leu Glu Glu Ala Pro Pre 
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Val He Val Glu Asp He Asp Pro Leu Thr Glu Thr Cys Val Arc Asr. 
42C 425 430 

5 

Thr Val Pro Cys Arg Lys Phe Val Lys He Asr. Asr Ser Thr Asr. Glu 
435 44: 445 

Gly Met Asn Val Lys Lys Cys Cys Lys Gly Phe Cys He Asp He Leu 
10 450 455 460 

Lys Lys Leu Ser Arg Thr Val Lys Phe Thr Tyr Asp Leu Tyr Leu Val 
465 470 475 480 

15 Thr Asn Gly Lys His Gly Lys Lys Val Asn Asn Val Trp Asn Gly Met 
485 490 495 

He Gly Glu Val Val Tyr Gin Arg Ala Val Met Ala Val Gly Ser Leu 
500 50 5 510 

20 

Thr He Asn Glu Glu Arg Ser Glu Val Val Asp Phe Ser Val Pro Phe 
515 520 525 

Val Glu Thr Gly He Ser Val Met Val Ser Arg Ser Asn Gly Thr Val 
25 530 535 540 

Ser Pro Ser Ala Phe Leu Glu Pro Phe Ser Ala Ser Val Trp Val Met 
545 550 555 560 

30 Met Phe Val Met Leu Leu He Val Ser Ala He Ala Val Trp Val Leu 

56 5 57: 575 

Asp Tyr Ser Ser Pro Val Gly Tyr Asn Arg Asn Leu Ala Lys Gly Lys 

5B0 585 590 

35 
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Ala Pro His 

Trp Gly Leu 
5 61C 

Thr Thr Ser 

625 

10 Phe Leu Ala 

Glu Phe Val 

15 

Pro His Asp 

675 

Ser Thr Glu 
20 690 

Met Thr Lys 

705 

25 Lys Thr Gly 

Tyr Lys Ala 

30 

Gly Tyr lie 

755 

Ser Pro Trp 

35 -t: 



Gly Pro Ser Phe 

Val Phe Asn Asn 
615 

Lys lie Me: Val 

630 

Ser Tyr Thr Ala 
645 

Asp Gin Val Thr 

660 

Tyr Ser Pro Pro 

Arg Asn lie Arg 
695 

Phe Asn Gin Lys 
710 

Lys Leu Asp Ala 
725 

Gly Arg Asp Glu 
74 0 

Phe Ala Thr Thr 

Lys Arc Gin He 



rhr lie Gly Lys A 1 a 

Ser Val Pro Val Gin 

620 

Ser Val Trp Ala Phe 
635 

Asn Leu Ala Ala Phe 

650 

Gly Leu Ser Asp Lys 
665 

Phe Arg Phe Gly Thr 
68C 

Asn Asn Tyr Pro Tyr 

700 

Gly Val Glu Asp Ala 
715 

Phe He Tyr Asp Ala 

730 

Gly Cys Lys Leu Val 
745 

Gly Tyr Gly lie Ala 

761 

Asp Leu Ala Leu Leu 



He Trp Leu Leu 

Asn Pro Lys Gly 

Phe Ala Val He 
64: 

Met He Gin Glu 
655 

Lys Phe Gin Arg 

670 

Val Pro Asn Gly 
685 

Met His Gin Tyr 

Leu Val Ser Leu 

72 0 

Ala Val Leu Asn 

735 

Thr He Gly Ser 

750 

Leu Gin Lys Gly 

Gin Phe Val Gly 
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Asp Gly Glu Me: Glu Glu Leu Glu Thr Leu Trp Leu Lhr Gly He Cys 



5 His Asn GIu Lys Asr. Glu Val Met Ser Ser Gin Leu Asp He Asp Asn 

S 10 5 S 1 0 s 1 5 

Me: Ala Gly Val Phe Tyr Met Leu Ala Ala Ala Met Ala Leu Ser Leu 

82 0 82 5 S3! 

10 

He Thr Phe He Trp Glu His Leu Phe Tyr Trp Lys Leu Arg Phe Cys 
335 840 845 

Phe Thr Gly Val Cys Ser Asp Arg Pro Gly Leu Leu Phe Ser He Ser 
15 850 855 860 

Arg Gly He Tyr Ser Cys He His Gly Val His lie Glu Glu Lys Lys 

865 370 375 880 

20 Lys Ser Pro Asp Phe Asn Leu Thr Gly Ser Gin Ser Asn Met Leu Lys 

885 890 895 

Leu Leu Arg Ser Ala Lys Asn He Ser Ser Met Ser Asn Met Asn Ser 

900 905 910 

25 

Ser Arg Met Asp Ser Pro Lys Arg Ala Ala Asp Phe He Gin Arg Gly 

915 920 925 

Ser Leu He Met Asp Met Val Ser Asp Lys Gly Asn Leu Met Tyr Ser 
30 930 935 940 

Asp Asn Arg Ser Phe Gin Gly Lys Glu Ser He Phe Gly Asp Asn Met 
35 Asn Glu Leu Gin Thr Phe Val Ala Asn Arg Gin Lys Asp Asn Leu Asn 
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Asr. 7yr Va I P'r.e Glr. Gly Gin His Pro Leu Thr Leu Asr. Glu Ser Asn 

93: 9S5 ??: 

5 

Pro Asr. Thr Val Glu Val Ala Val Ser Thr Glu Ser Lys Ala Asr: Ser 



Arg Pro Arg Glr Leu Trp Lys Lys Ser Val Asp Ser lie Arg Glr Asp 

10 1C1C 1015 1020 

Ser Leu Ser Gin Asn Pro Val Ser Gin Arg Asp Glu Ala Thr Ala Glu 
1025 1030 1035 1040 

15 Asn Arg Thr His Ser Leu Lys Ser Pro Arg Tyr Leu Pro Glu Glu Met 
1045 1C50 1055 

Ala His Ser Asp lie Ser Glu Thr Ser Asn Arg Ala Thr Cys His Arg 
1060 1065 1070 

20 

Glu Pro Asp Asn Ser Lys Asn His Lys Thr Lys Asp Asn Phe Lys Arg 

1075 10 80 1085 

Ser Val Ala Ser Lys Tyr Pro Lys Asp Cys Ser Glu Val Glu Arg Thr 

25 1090 1095 1100 

Tyr Leu Lys Thr Lys Ser Ser Ser Pro Arg Asp Lys He Tyr Thr He 
1105 1110 1115 112 0 

30 Asp Gly Glu Lys Glu Pro Gly Phe His Leu Asp Pro Pro Gin Phe Val 

1125 1130 1135 

Glu Asn Val Thr Leu Pro Glu Asn Val Asp Phe Pro Asp Pro Tyr Glr. 

35 
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Asp Pre Ser Glu Asr. Phe Arc lys Gly Asp Ser Ihr leu Pro Met Asr 



Arg Asr Pro leu His Asr Glu Glu Gly leu Ser Asr Asr Asp Gin Tyr 
5 117" 1175 us: 

Lys leu Tyr Ser Lys His Pre Thr leu lys Asp lys Gly Ser Pro His 
1165 1190 1195 12C0 

10 Ser Glu Thr Ser Glu Arg Tyr Arg Gin Asn Ser Thr His Cys Arg Ser 

1205 1210 1215 

Cys Leu Ser Asn Met Pro Thr Tyr Ser Gly His Phe Thr Met Arg Ser 
1220 1225 1230 

15 

Pro Phe Lys Cys Asp Ala Cys Leu Arg Met Gly Asn Leu Tyr Asp lie 
1235 1240 1245 

Asp Glu Asp Gin Met leu Gin Glu Thr Gly Asn Pro Ala Thr Gly Glu 
20 1250 1255 1260 

Gin Val Tyr Gin Gin Asp Trp Ala Gin Asn Asn Ala leu Gin Leu Gin 
1265 1270 1275 1280 

25 Lys Asn Lys Leu Arg He Ser Arg Gin His Ser Tyr Asp Asn He Val 

1285 1290 1295 

Asp Lys Pro Arg Glu Leu Asp Leu Ser Arg Pro Ser Arg Ser lie Ser 
13 0 0 1305 1310 



30 



leu lys Asp Arg Glu Arg leu leu Glu Gly Asn Phe Tyr Gly Ser leu 
1315 13 2 0 13 2 5 



Phe Ser Val Pro Ser Ser lys leu Ser Gly lys lys Ser Ser leu Phe 
35 1330 133 5 134: 
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Pro Glr. Gly Leu Glu Asp Ser Lys Arc Ser Lys Ser Leu Leu 



His Thr Ser Asp Asn Pro Phe Leu His Ser His Arg Asp Asp Gin Arc 
1365 1370 1375 

Leu Val He Gly Arg Cys Pro Ser Asp Pro Tyr Lys His Ser Leu Pro 

13 6 0 1 3 S 5 13 91 

Ser Gin Ala Val Asn Asp Ser Tyr Leu Arg Ser Ser Leu Arg Ser Thr 
13 9 5 14 0 0 14 0 5 



Ala Ser Tyr Cys Ser Arg Asp Ser Arg Gly His Asn Asp Val Tyr lie 
15 1410 1415 142 0 

Ser Glu His Val Met Pro Tyr Ala Ala Asn Lys Asn Asn Met Tyr Ser 
1425 1430 1435 14 4 C 

20 Thr Pro Arg Val Leu Asn Ser Cys Ser Asn Arg Arg Val Tyr Lys Glu 

1445 1450 1455 

Met Pro Ser lie Glu Ser Asp Val 
1460 



25 



(2) INFORMATION FOR SEQ ID NO: 12: 

( l ) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 74 base pairs 

{3) TYPE: nucleic acici 

( C ) ST RAND E ONE SS : bo z h 

(D) TOPOLOGY: both 



35 (ID MGLECL'LE TYPE: cD:;A 
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ixi) SEQUENCE RESCRIPT: OX : SEQ 13 NO: 12: 
CGAGGGAGGC GGCCGGCGCG GACTCTCTTC GCGGGCGCAG CGCCCCTTCC CCCTCGGACC 6 

5 

CTCCGGTGGA CATG 



10 



(2) INFORMATION FOR SEQ ID NO : 1 3 : 



74 



tl) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
15 <D) TOPOLOGY: both 

(11) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
20 (A) NAME / KEY : CDS 

(B) LOCATION: 210.. 4664 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 3 : 

25 TTGAATT TGC ATCTCTTCAA GACACAAGAT TAAAACAAAA TTTACGCTAA ATTGGATTTT 6 0 

AAATTATCTT CCGTTCATTT ATCCTTCGTC TTTCTTATGT GGATATGCAA GCGAGAAGAA 120 

GGGACTGGAC ATTCCCAACA TGCTCACTCC CTTAATCTGT CCGTCTAGAG GTTTGG OTTC ISC 

30 

TACAAACCAA GGGAGTCGAC GAGTTGAAG ATG AAG CCC AGA GCG GAG TGC TGT 23 3 

Mel Lys Pro Arg Ala Glu Cys C/s 



35 



7CT CC0 AAG TTC T3G TTG 3TG TTG GCC GTC C7G GCC GTG TCA 0GC AGC 
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. rp 



A_a Va. Le'j 



Ser Gly Ser 



AGA GCT GGT TCT CAG AAG AGC CCC CCC AGC ATT GGC ATT GCT GTC ATC 3 29 

5 Arg Ala Arg Ser Gin Lys Ser Pro Pro Ser He Gly He Ala 7a 1 He 

25 31 35 40 

CTC GTG GGC ACT TCC GAC GAG GTG GCC ATC AAG GAT GCC CAC GAG AAA 3 77 

Leu Val Gly Thr Ser Asp Glu Val Ala He Lys Asp Ala His Glu Lys 

10 45 5D 55 

GAT GAT TTC CAC CAT CTC TCC GTG GTA CCC CGG GTG GAA CTG GTA GCC 42 5 

Asp Asp Phe His His Leu Ser Val Val Pro Arg Val Glu Leu Val Ala 

60 65 70 

15 

ATG AAT GAG ACC GAC CCA AAG AGC ATC ATC ACC CGC ATC TGT GAT CTC 473 

Met Asn Glu Thr Asp Pro Lys Ser He He Thr Arg He Cys Asp Leu 

75 33 85 

20 ATG TCT GAC CGG AAG ATC CAG GGG GTG GTG TTT GCT GAT GAC AC A GAC 521 

Met Ser Asp Arg Lys He Gin Gly Val Val Phe Ala Asp Asp Thr Asp 

9 0 9 5 10 0 

CAG GAA GCC ATC GCC CAG ATC CTC GAT TTC ATT TCA GCA CAG ACT CTC 569 

25 Gin Glu Ala He Ala Gin He Leu Asp Phe He Ser Ala Gin Thr Leu 

105 113 H5 120 

ACC CCG ATC CTG GGC ATC CAC GGG GGC TCC TCT ATG ATA ATG GCA GAT 617 

Thr Pro He Leu Gly He His Gly Gly Ser Ser Met He Met Ala Asp 

30 125 130 135 

AAG GAT 3AA TCI TC0 AT 3 TTC TTC CAG TTT GGC CCA TCA ATT 3AA CAG 66 5 

Lys Asp Glu Ser Ser Met Phe Phe 3 In Phe Gly Pro Ser He 31 u Gin 

14C 145 151 

35 
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Ser Val Met Leu Asn He Me: Glu Glu - yr Asp 7rp . yr 



5 TTT TCT ATC GTC ACC ACC TAT TTC CCT GGC TAT GAG GAC TTT GTA AAC 761 

?he Ser He Val Thr Thr Tyr Pre Pro Gly Tyr Glr Asp Phe Val Asn 

it: 175 is: 

AAG ATC CGG AGC ACT ATT GAG AAT AGG TTT GTG GGC TGG GAG GTA GAG 8 09 

10 Lys He Arg Ser Thr He Glu Asr Ser Phe Val Gly Trp Glu Leu Glu 

135 190 19 5 2 0 C 

GAG GTC CTC CTA CTG GAC ATG TCC CTG GAC GAT GGA GAT TCT AAG ATC 857 

Glu Val Leu Leu Leu Asp Met Ser Leu Asp Asp Gly Asp Ser Lys He 

15 205 210 215 

GAG AAT CAG CTC AAG AAA CTT CAA AGC CCC ATC ATT CTT CTT TAC TGT 905 

Gin Asn Gin Leu Lys Lys Leu Gin Ser Pro He He Leu Leu Tyr Cys 

220 225 230 

20 

ACC AAG GAA GAA GCC ACC TAC ATC TTT GAA GTG GCC AAC TCA GTA GGG 953 

Thr Lys Gl a Glu Ala Thr Tyr He Phe Glu Val Ala Asn Ser Val Gly 

235 240 245 

25 CTG ACT GGC TAT GGC TAC ACG TGG ATC GTG CCC AGT CTG GTG GCA GGG 1001 

Leu Thr Gly Tyr Gly Tyr Thr Trp He Val PrD Ser Leu Val Ala Gly 

2 5 0 2 5 5 2 6 1 

GAT ACA GAC AC A GTG CCT GCG GAG TTC CCC ACT GGG CTC ATC TCT GTA 104 9 

30 Asp Thr Asp Thr Val Pro Ala Glu Phe Pr:> Thr Gly Leu He Ser Val 

265 270 275 2 80 

TCA TAT GAT 3AA TG3 GAC TAT GGC CTC CCZ CC 7 AG A GTG AG A GAT GGA H9~ 

Ser Tyr Asp Glu Trp Asp Tyr Gly Leu Pro Pro Arc Val Arg Asp Gly 



35 
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ATT GCC ATA ATC ACC ACT GCT GCT TCT GAC ATG CTG TCT GAG CAT AG C 114 5 

lie Ala lie lie Thr Thr Ala Ala Ser Asp Met leu Ser Glu His Ser 

3CO 3C5 31C 

5 

TTC ATC CCT GAG CCC AAA AGC AGT TGT TAC AAC ACC CAC GAG AAG AGA 1193 

Phe He Pro Glu Pro Lys Ser Ser Cys Tyr Asn Thr His Glu Lys Arg 

315 320 325 

10 ATC TAC CAG TCC AAT ATG CTA AAT AGG TAT CTG AT 1 AAT GTC ACT TTT 1241 

lie Tyr Gin Ser Asn Met Leu Asn Arg Tyr Leu lie Asn Val Thr Phe 
330 335 34D 

GAG GGG AGG AAT TTG TCC TTC AGT GAA GAT GGC TAC CAG ATG CAC CCG 12 89 

15 Glu Gly Arg Asn Leu Ser Phe Ser Glu Asp Gly Tyr Gin Met His Pro 
345 350 355 350 

AAA CTG GTG ATA ATT CTT CTG AAC AAG GAG AGG AAG TGG GAA AGG GTG 13 3 7 

Lys Leu Val He He Leu Leu Asn Lys Glu Arg Lys Trp Glu Arg Val 

20 365 370 375 

GGG AAG TGG AAA GAC AAG TCC CTG CAG ATG AAG TAC TAT GTG TGG CCC 1385 

Gly Lys Trp Lys Asp Lys Ser Leu Gin Met Lys Tyr Tyr Val Trp Pro 

380 335 390 

25 

CGA ATG TGT CCA GAG ACT GAA GAG CAG GAG GAT GAC CAT CTG AGC ATT 143 3 

Arg Met Cys Pro Glu Thr Glu Glu Gin Glu Asp Asp His Leu Ser He 

3 9 5 4 0 0 4 0 5 

30 GTG ACC CTG GAG GAG GCA CCA TTT GTC ATT GTG GAA AGT GTG GAC COT 14 81 

Val Thr Leu Glu Glu Ala Pro Phe Val He Val Glu Ser Val Asp Pro 
4 10 4 15 4 2 0 

CTG AGT GGA ACC TGC ATG AGG AAC ALA GT 0 CCC TGI CAA AAA CGO ATA 1529 

35 Leu Ser Gly Thr Cys Met Arg Asn Thr Val Pre Cys Gin Lys Arg lie 
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10 



GTC ACT GAG AAT AAA AC A GAC GAG GAG CCG GGT 7 AC ATC AAA AAA TGC 157 7 

Val Thr Glu Asr. Lys Thr Asp Glu Glu Pro Gly Tyr He Lys lys Cys 

445 45: 455 

TGC AAG GGG TTC TGT ATT GAC ATC CTT AAG AAA ATT TCT AAA TCT GTG 162 5 

Cys Lys Gly Phe Cys He Asp He Leu Lys Lys He Ser Lys Ser Val 

46C 465 47C 

AAG TTC ACC TAT GAC CTT TAC CTG GTT ACC AAT GGC AAG CAT GGG AAG 167 3 

Lys Phe Thr Tyr Asp Leu Tyr Leu Val Thr Asr. Gly Lys His Gly Lys 

475 480 435 



15 AAA ATC AAT GGA ACC TGG AAT GGT ATG ATT GGA GAG GTG GTC ATG AAG 1721 

Lys He Asn Gly Thr Trp Asn Gly Met He Gly Glu Val Val Met Lys 

4 9 D 4 95 500 

AGG GCC TAC ATG GCA GTG GGC TCA CTC ACC ATC AAT GAG GAA CGA TCG 176 9 

20 Arg Ala Tyr Met Ala Val Gly Ser Leu Thr He Asn Glu Glu Arg Ser 

505 510 515 520 

GAG GTG GTC GAC TTC TCT GTG CCC TTC ATA GAG AC A GGC ATC AGT GTC 1817 

Glu VaL Val Asp Phe Ser Val Pro Phe He Glu Thr Gly He Ser Val 

25 525 53: 535 

ATG GTG TCA CGC AGO AAT GGG ACT GTC TCA CCT TCT GCC TTC TTA GAG 186 5 

Met Val Ser Arg Ser Asn Gly Thr Val Ser Pro Ser Ala Phe Leu Glu 
540 545 550 



30 



CCA TTC AGC GCT GAC GTA TGG GTG ATG ATG TTT GTG ATG CTG CTC ATC 1913 
?r: Phe Ser Ala Asp Val Trp Val Met Me: Phe Val Met leu leu He 



35 GTC TCA GCC GTG GCT GTC TTT GTC TTT GAG TAC TTC AGC CCT 3.G 
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Val Ser Ala Val Ala Val ?he Val Phe Glu Tyr Phe Ser Pro Val Gly 

TAT AAC AGG TGC CTC GCT GAT GGC AGA GAG CCT GGT GGA CCC TCT TTC 2 0 09 

5 Tyr Asn Arg Cys Leu Ala Asp Gly Arg Glu Pro Gly Gly Pro Ser Phe 

595 591 595 6C1 

ACC ATC GGC AAA GCT ATT TGG TTG CTC TGG GGT CTG GTG TTT AAC AAC 2 0 57 

Thr He Gly Lys Ala He Trp Leu Leu Trp Gly Leu Val Phe Asn Asn 
10 605 613 615 

TCC GTA CCT GTG CAG AAC CCA AAG GGG ACC ACC TCC AAG ATC ATG GTG 2105 

Ser Val Pro Val Gin Asn Pro Lys Gly Thr Thr Ser Lys lie Met Val 
620 625 630 

15 

TCA GTG TGG GCC TTC TTT GCT GTC ATC TTC CTG GCC AGC TAG ACT GCC 215 3 

Ser Val Trp Ala Phe Phe Ala Val He Phe Leu Ala Ser Tyr Thr Ala 

635 640 645 

20 AAC TTA GCT GCC TTC ATG ATC CAA GAG GAA TAT GTG GAC CAG GTT TCT 22 01 

Asn Leu Ala Ala Phe Met He Gin Glu Glu Tyr Val Asp Gin Val Ser 

650 655 66D 

GGC CTG AGC GAC AAA AAG TTC CAG AGA CCT AAT GAC TTC TCA CCC CCT 2 24 9 

25 Gly Leu Ser Asp Lys Lys Phe Gin Arg Pro Asn Asp Phe Ser Pro Pro 

665 670 675 680 

TTC CGC TTT GGG ACC GTG CCC AAC GGC AGC ACA GAG AGA AAT ATT CGC 2 2 97 

Phe Arg Phe Gly Thr Val Pro Asn Gly Ser Thr Gl i Arg Asn lie Arg 
30 685 690 695 

AAT AA0 TAT GCA GAA. ATG CAT GCC TAC ATG GGA AAG TTC AAC CAG AGG 2 3 4 5 

Asn Asn Tyr Ala Glu Met His Ala Tyr Met Gly Lys Phe Asn Gin Arg 



35 
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Gly Val Asp Asp Ala Leu leu Ser leu lys Thr Gly lys leu Asp Ala 



5 TTC ATC TAT GAT GCA GCA GTG CTG AAC TAT ATG GCA GGC AGA GAT GAA 24 41 

?he lie Tyr Asp Ala Ala Val Leu Asn Tyr Met Ala Gly Arg Asp Glu 

73 D 73 5 740 

GGC TGC AAG CTG GTG ACC ATT GGC ACT GGG AAG GTC TTT GCT TCC ACT 24 6 9 

10 Gly Cys Lys Leu Val Thr He Gly Ser Gly Lys Val Phe Ala Ser Thr 

745 750 755 750 

GGC TAT GGC ATT GCC ATC CAA AAA GAT TCT GGG TGG AAG CGC CAG GTG 2 5 37 

Gly Tyr Gly lie Ala He Gin Lys Asp Ser Gly Trp Lys Arg Gin Val 

15 765 77C 775 

GAC CTT GCT ATC CTG CAG CTC TTT GGA GAT GGG GAG ATG GAA GAA CTG 2 5 85 

Asp Leu Ala He Leu Gin leu Phe Gly Asp Gly Glu Met Glu Glu leu 

78 0 735 790 

20 

GAA GCT CTC TGG CTC ACT GGC ATT TGT CAC AAT GAG AAG AAT GAG GTC 26 3 3 

Glu Ala Leu Trp Leu Thr Gly He Cys His Asn Glu Lys Asn Glu Val 

795 80 0 305 

25 ATG AGC AGC CAG CTG GAC ATT GAC AAC ATG GCA GGG GTC TTC TAC ATG 2 6 81 

Met Ser Ser Gin Leu Asp He Asp Asn Met Ala Gly Val Phe Tyr Met 

810 315 320 

TTG GGG GCG GCC ATG GCT CTO AGC CTC ATC ACC TTC ATC TGC GAA CAC 2 72 9 

30 Leu Gly Ala Ala Met Ala Leu Ser Leu He Thr Phe He Cys Glu His 

325 830 335 340 

CTT TTC TAT TGG CAG TTC CGA CAT TGC TTT ATG GGT GTC TGT TCT GGC 2 "7 

Leu Phe Tyr Trp Gin Phe Arg His Cys Phe Met Gly Val Cys Ser Gly 

35 S45 65C S 5 E 
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AAG CCT GGC ATG GTC TTC TCC ATC AGC AG A GGT ATC TAG AGC TGC ATG 2 r H 

Lys Pro Gly Me: Va 1 ?he 3er He Ser Arg Gly He Tyr Ser Cys He 

861 365 370 

5 

CAT GGG GTG GCG ATC GAG GAG CGC CAG TCT GTA ATG A^C TCC C OC ACC 2 57 3 

His GLy Vai Ala He Glu Glu Arg Gin Ser Va 1 Me: Asn Ser Pro Thr 

37 5 S3! 9 3 5 

10 GCA ACC ATG AAC AAC ACA CAC TCC AAC ATC CTG CGO CTG CTG CGC ACG 2 921 

Ala Thr Me: Asn Asn Thr His Ser Asn lie Leu Arg Leu Leu Arg Thr 

39 3 3 95 90 j 

GCC AAG .AAC ATG GCT AAC CTG TCT GGT GTG AAT GGC TO A CCG CAG AGC 2969 

15 Ala Lys Asn Me: Ala Asn Leu Ser Gly Val Asn Gly Ser Pro Gin Ser 

905 91D 915 92D 

GCC CTG GAC TTC ATC CGA CGG GAG TCA TCC GTC TAT GAC ATC TCA GAG 3 017 

Ala Leu Asp Phe He Arg Arg Glu Ser Ser Val Tyr Asp lie Ser GLu 
20 925 930 935 

CAC CGC CGC AGC TTC ACG CAT TCT GAC TGC AAA TCC TAC AAC AAC CCG 3 065 

His Arg Arg Ser Phe Thr His Ser Asp Cys Lys Ser Tyr Asn Asn Pro 
940 945 950 

25 

CCC TGT GAG GAG AAC CTC TTC AGT GAC TAC ATC AGT GAG GTA GAG AGA 3113 

PrD Cys Glu Glu Asn Le j Phe Ser Asp Tyr He Ser Glu Val Glu Arg 
955 960 965 

30 ACG TTC GGG AA0 CTG CAG CTG AAG GAC AGC .AAC GTG TAC CAA GAT CAC 3161 

Thr Phe Gly Asn Leu Gin Leu Lys Asp Ser Asn Val Tyr Gin Asp His 

97: 975 93: 

ta: :a: cat ca: cac ogg ccc cat agt att ggc agt gcc agc rcc atc 32:9 

35 Tyr His His His His Arg Pro His Ser lie Gly Ser Ala Ser Ser He 
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GAT GGG CTC TAG GAG TGT GAG AAG CCA CCC 7TC ACC ACC GAG TCC AGG 
Asp Gly Leu Tyr Asp Gys Asp A sr. Pro Pro Phe Thr Thr Gin Ser Arg 



TCC ATG AGC AAG AAG CCC CTG GAG ATG GGG CTG CCC TCC TCC AAG GAG 3 3 J 5 

Ser lie Ser Gys Lys Pro Leu Asp He Gly Leu Pro Ser Ser Lys His 
1020 1025 1030 

AGG CAG CTG AGT GAG GTG TAG GGG AAA TTG TCG TTG AAG AGC GAG GGG 33 53 

Ser Gin Leu Ser Asp Leu Tyr Gly Lys Phe Ser Phe Lys Ser Asp Arg 

1035 1040 1045 



15 TAG AGT GGG CAC GAG GAC TTG ATG CGC TCG GAT GTC TCT GAC ATC TGA 34 01 

Tyr Ser Gly His Asp Asp Leu He Arg Ser Asp Val Ser Asp He Ser 

1050 1055 1050 

AGG CAG ACC GTG ACG TAT GGG AAG ATG GAG GGG AAT GCC GCC AAG AGG 344 9 

20 Thr His Thr Val Thr Tyr Gly Asn He Glu Gly Asn Ala Ala Lys Arg 

1D65 1070 1375 1330 

CGT AAG CAG CAA TAT AAG GAG AGC CTG AAG AAG CGG CCT GCC TCG GGG 3 4 97 

Arg Lys Gin Gin Tyr Lys Asp Ser Leu Lys Lys Arg Pro Ala Ser Ala 

25 10r.5 1090 1095 

AAG TCC CGG AGG GAG TTT GAG GAG ATG GAG CTG GCC TAC CGT CGG GGA 3 54 5 

Lys Ser Arg Arg Glu Phe Asp Glu He Glu Lei Ala Tyr Arg Arg Arg 

1100 1135 lilC 



30 



CGG GCC CGG TCG CCT GAG CAG AAG CGG TAC TTG AGG GAC AAG GAA GGG 3 5 93 

PrD Pro Arg Ser Pro Asp His Lys Arg Tyr Phe Arg Asp Lys Glu Gl/ 



35 GTA CGG GAG TT0 TA0 GTG GA0 CAG TTG CGA ACA AAG GAG AAC TGA 2CC 3 64 1 
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Leu Arg Asp Phe Tyr Lej Asp Glr. ?he 



CAC TGG GAG CAC GTA GAC CTG ACC GAC ATC TAC AAG GAG CGG AGT GAT 
His Trp Glu His Val Asp Leu Thr Asp He Tyr Lys Glu Arg Ser Asp 



GAG TTT AAG CGG GAC TCC ATC AGO GGA GGA GGG CCC TGT ACC AAC AGG 3 737 

Asp ?he Lys Arg Asp Ser He Ser Gly Gly Gly Pro Cys Thr Asn Arg 

10 1165 1170 1175 

TCT CAC ATC AAG CAC GGG ACG GGC GAC AAA CAC GGC GTG GTC AGC GGG 3785 

Ser His He Lys His Gly Thr Gly Asp Lys His Gly Val Val Ser Gly 
1130 1135 1150 

15 

GTA CCT GCA CCT TGG GAG AAG AAC CTG ACC AAC GTG GAG TGG GAG GAC 3 83 3 

Val Pro Ala Pro Trp Glu Lys Asn Leu Thr Asn Val Glu Trp Glu Asp 

1195 12 0 0 12 0 5 

20 CGG TCC GGG GGC AAC TTC TGC GGC AGC TGT CCC TCC AAG CTG CAC AAC 3 8 81 

Arg Ser Gly Gly Asn Phe Cys Arg Ser Cys Pro Ser Lys Leu His Asn 
1210 1215 1220 

TAC TCC ACG ACG GTG ACG GGT CAC AAC TCG GGC AGG GAG GCG TGC ATC 3 92 9 

25 Tyr Ser Thr Thr Val Thr Gly Gin Asn Ser Gly Arg Gin Ala Cys He 

1225 1230 1235 1240 

CGG TGT GAG GCT TGC AAG AAA GCA GGC AAC CTG TAT GAC ATC AGT GAG 3 9 77 

Ar? Cys Glu Ala Cys Lys Lys Ala Gly Asn Leu Tyr Asp He Ser Glu 

30 1245 1250 1255 

GAC AA0 TC: CT3 DAG GAA CTG GAT. CAS CC 2 GCT GCC CCA GTG GCG GTG 4Z 2 5 

Asp Asn Ser Leu 31n Glu Leu Asp Gin ?rj Ala Ala PrD Val Ala Val 



35 
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ACG TCA AAC GCC TCC ACC ACT AAG TAC CCT CAG AGC CCG 
Thr Ser A sr. Ala Ser Thr Thr Lys Tyr Pro Gin Ser Pro 



5 AAG GCC CAG .AAG AAG AAC CGG AAC AAA CTG CGC CGG CAG CAC TCC TAC 4121 

Lys Ala Gin lys lys A sr. Arg Asn Lys leu Arg Arg Gin His Ser Tyr 

1290 1295 13D0 

GAC ACC TTC GTG GAC CTG CAG AAG GAA GAA GCC GCC CTG GCC CCG CGC 4169 

10 Asp Thr Phe Val Asp Leu Gin Lys Glu Glu Ala Ala Leu Ala Pro Arg 
13D5 1310 1315 1320 

AGO GTA AGC CTG AAA GAC AAG GGC CGA TTC ATG GAT GGG AGC CCC TAC 4217 

Ser Val Ser Leu Lys Asp Lys Gly Arg Phe Met Asp Gly Ser Pro Tyr 
15 1325 1330 1335 

GCC CAC ATG TTT GAG ATG TCA GCT GGC GAG AGC ACC TTT GCC AAC AAC 42 6 5 

Ala His Met Phe Gl a Met Ser Ala Gly Glu Ser Thr Phe Ala Asn Asn 
134 0 1345 1350 

20 

AAG TCC TCA GTG CCC ACT GCC GGA CAT CAC CAC CAC AAC AAC CCC GGC 4313 

Lys Ser Ser Val Pro Thr Ala Gly His His His His Asn Asn Pro Gly 

1355 135 0 13 6 5 

25 GGC GGG TAC ATG CTC AGC AAG TCG CTC TAC CCT GAC CGG GTC ACG CAA 43 61 

Gly Gly Tyr Met Lei Ser Lys Ser Leu Tyr Pro Asp Arg Val Tnr Gin 

1370 1375 1330 

AAC CCT TTC ATC CCC ACT TTT GGG GAC GAC CAG TGC TTG CTC CAT GGC 4 4 0 9 

30 Asn Pro Phe lie Pro Thr Phe Gly Asp Asp Gin Cys Le^ Lei His Gly 

13 35 13 90 13 95 14 0 0 

AG C AAA TCC TA 2 TTC TT0 AGG CAG C0C ACG GTG GC 3 GGG GOG TCG AAA 4 4 1 " 7 

Ser Lys Ser Tyr Phe Phe Arg Gin Pro Thr Val Ala Gly Ala Ser Lys 

35 14- ;<". :4ii 
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GCC AGG CCG GAC TTC CGG GCC CTT GTC ACC AAC AAG CCG GTG GTC TCG 45:5 

Ala Arc Pro Asp Phe Arc Ala leu Yal Thr Asr. Lys Pro Val Yal Ser 

1421 1425 1431 

5 

GCC CTT CAT GGG GCC GTG CCA GCC CGT TTC CAG AAG GAC ATT TGT ATA 45 53 

Ala Leu His Gly Ala Val Pro Ala Arg Phe Gin Lys Asp He Cys lie 

1435 1440 1445 

10 GGG AAC CAG TCC AAC CCC TGT GTG CCT AAC .AAC AC A AAC CCC AGG GCT 4 601 

Gly Asn Gin Ser Asr. Pro Cys Val Pro Asn Asr Thr Asn Pro Arg Ala 
1450 1455 1460 

TTC AAT GGC TCC AGC AAT GGG CAT GTT TAT GAG AAA CTT TCT ACT ATT 4 64 9 

15 Phe Asn Gly Ser Ser Asn Gly His Val Tyr Glu Lys Leu Ser Ser lie 
1465 1470 1475 1480 

GAG TCT GAT GTC TGAGTGAGGG AACAGAGAGG TTAAGGTGGG TACGGGAGGG 4 701 

Glu Ser Asp Val 
20 148 

TAAGGCTGTG GGTCGCGTGA TGCGCATGTC ACGGAGGGTG ACGGGGGTGA ACTTGGTTCC 47 61 



25 



JATTTGCTCC TTTCTTGTTT TAATTTATTT ATGGGATCCT GGAGTTCTGG TTCCTACTGG 4 821 



3GGCAACCCT GGTGACCAGC ACCATCTCTC CTCCTTTTCA CAGTTCTCTC CT1CTTCCCC 4881 



CCGCTGTC AG CCATTCCTGT TCCCATGAGA TGATGCCATG GGCCCTCTCA GCAGGGGAGG 4 941 



30 GTAGAGCGGA GAAAGGAAGG GCTGCATGCG GGCTTCC TCC TGGTGTGG.AA GAGCTCCTTG 5CC1 



\TATCCTCTT TGAGTGAAGC TGGGAGA-ACC AA.AAAGAGGC TA1GTGAGCA CAA-AGGTAGC 5 C 6 1 



7 T T T C 0 C AAA 7TGATCTTT7 C AT T T AGG TG A GG AAG C AAA AGCA.C.A' 



35 
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3CACAC7G C77G7GAAAG GAAAGAGGCT C7GGC7AAA7 7CA7GC7GC1 7 AG AT G AC A - 



CTGTC7AGGA A7CA7G7GCC AAGCAGAGG7 7GGGAGGCCA 777G7G777A 7A7A7AAGCC 



5 CAAAAA7GCT 7GC77CAACG CCA7GAGAC7 CGA7AG7GG7 GG 7 G AAC AG A ACCCAAGGTC 53 21 



A77GG7GGCA GAG7GGA77 2 TTGAACAAAC 7GGAAAG7AC GTTATGATAG TGTCCCCCGG 53 61 



10 



15 



TGCCTTGGGG ACAAGAGCA 3 G7GGA77G7G CGTGCATGTG 7G77CA7GCA CA2TTGCACC 542 1 



CATGTGTAGT CAGGTGCCTC AAGAGAAGGC AACCTTGACT CTTTCGTTGA ATTTGCATCT 54 81 



CTTCAAGACA CAAGATTAAA ACAAAATTTA CGCTAAATTG GATTTTAAAT TATCTTC 5 53 8 



(2} INFORMATION FOR SEQ ID NO: 14: 



(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1484 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: protein 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Lys Pro Arg Ala Glu Cys Cys Ser Pro Lys Phe Trp Leu Val Leu 
15 10 15 

30 Ala Val Leu Ala Val Ser Gly Ser Arg Ala Arg Ser Gin Lys Ser Pro 

2Z 25 30 

Pro Ser lie Gly lie Ala Val He Leu Val 3iy Thr Ser Asp Glu Val 
3 5 4 C 4 5 

35 
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Ala He Lys Asp Ala His Glu Lys Asp Asp Phe His His Leu Ser Val 



Val Pro Arg Val Glu Leu Val Ala Met A sr. Glu Thr Asp Pro Lys Ser 
5 65 7C 75 SO 

lie He Thr Arg He Cys Asp Leu Met Ser Asp Arg Lys He Gin Gly 

8 5 9 0 9 5 

10 Val Val Phe Ala Asp Asp Thr Asp Gin Glu Ala He Ala Gin lie Leu 

10C 105 110 

Asp Phe He Ser Ala Gin Thr Leu Thr Pro He Leu Gly He His Gly 
115 120 125 

15 

Gly Ser Ser Met He Met Ala Asp Lys Asp Glu Ser Ser Met Phe Phe 

130 135 140 

Gin Phe Gly Pro Ser He Glu Gin Gin Ala Ser Val Met Leu Asn He 
20 145 150 155 160 

Met Glu Glu Tyr Asp Trp Tyr He Phe Ser He Val Thr Thr Tyr Phe 
165 170 175 

25 Pro Gly Tyr Gin Asp Phe Val Asn Lys He Arg Ser Thr He Glu Asn 

180 185 190 

Ser Phe Val Gly Trp Glu Leu Glu Glu Val Leu Leu Leu Asp Met Ser 

195 200 205 



30 



Leu Asp Asp Gly Asp Ser Lys He Gin Asn Gin Leu Lys Lys Leu Gin 

2 1 1 2 2 5 2 2 0 



Ser Pro He He Leu Leu Tyr Cys Thr Lys Glu Glu Ala Thr ?yr He 
35 225 231 235 241 
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?he Glu Val Ala Asn Ser Val Gly Leu Thr Gly ?yr Giy Tyr rhr Trp 



5 He Val Pro Ser Leu Val Ala Gly Asp Thr Asp Thr Val Pro Ala Glu 

261 265 271 

Phe Pro Thr Gly Leu lie Ser Val Ser Tyr Asp Glu Trp Asp Tyr Gly 

27 5 2 80 285 

10 

Leu Pro Pro Arg Val Arg Asp Gly lie Ala lie lie Thr Thr Ala Ala 

290 295 300 

Ser Asp Met Leu Ser Glu His Ser Phe lie Pro Glu Pro Lys Ser Ser 
15 305 310 315 320 

Cys Tyr Asn Thr His Glu Lys Arg lie Tyr Gin Ser Asn Met Leu Asn 

325 330 335 

20 Arg Tyr Leu lie Asn Val Thr Phe Glu Gly Arg Asn Leu Ser Phe Ser 
3 40 345 35 0 

Glu Asp Gly Tyr Gin Met His Pro Lys Leu Val lie lie Leu Leu Asn 
355 360 365 

25 

Lys Glu Arg Lys Trp Glu Arg Val Gly Lys Trp Lys Asp Lys Ser Leu 

370 375 380 

Gin Met Lys Tyr Tyr Val Trp Pro Arg Met Cys Pro Glu Thr Glu Glu 
30 385 390 395 400 

Gin Glu Asp Asp His Leu Ser He Val Thr Leu Glu Glu Ala Pro Phe 
415 4i: 415 



35 Val He Val Glu Ser Val Asp Pro Leu Ser Gly Thr Cys Met Arg Asn 
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5 



Thr Val Pro Cys Gin Lys Arc He Val Tr.r Glu A sr. Lys . hr Asp Glu 

435 44C 445 

Glu Pro Gly Tyr He Lys Lys Cys Cys Lys Gly Phe Cys He Asp He 

4 5 0 4 5 5 4 6 0 



Leu Lys Lys He Ser Lys Ser Val Lys Phe Thr Tyr Asp Leu Tyr Leu 
10 465 470 475 48C 

Val Thr Asn Gly Lys His Gly Lys Lys He Asn Gly Thr Trp Asn Gly 
485 490 495 

15 Met He Gly Glu Val Val Met Lys Arg Ala Tyr Met Ala Val Gly Ser 

500 505 510 

Leu Thr He Asn Glu Glu Arg Ser Glu Val Val Asp Phe Ser Val Pro 
515 520 525 

20 

Phe He Glu Thr Gly He Ser Val Met Val Ser Arg Ser Asn Gly Thr 
530 535 540 

Val Ser Pro Ser Ala Phe Leu Glu Pro Phe Ser Ala Asp Val Trp Val 
25 545 550 555 560 

Met Met Phe Val Met Leu Leu He Val Ser Ala Val Ala Val Phe Val 

565 570 575 

30 Phe Glu Tyr Phe Ser Pro Val Gly Tyr Asn Arg Cys Leu Ala Asp Gly 

5 8 C 5 8 5 5 9 0 

Arg Glu Pro Gly Gly Pro Ser Phe Thr He Gly Lys Ala He Trp Leu 

35 
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Le ^ Trp Gly Leu Val ?he A sr. A sr. Ser 7a 1 Pro Val Glr. A sr. Pro Lys 



Gly Thr Thr Ser Lys He Met Val Ser Val Trp Ala Pre Phe Ala Val 
5 625 630 635 64C 

He Phe Leu Ala Ser Tyr Thr Ala Asr. Leu Ala Ala Phe Met lie Glr. 

645 651 655 

10 Glu Glu Tyr Val Asp Glr. Val Ser Gly Leu Ser Asp Lys Lys Phe Glr. 

660 665 670 

Arg Pro Asn Asp Phe Ser Pro Pro Phe Arg Phe Gly Thr Val Pro Asn 

675 680 685 

15 

Gly Ser Thr Glu Arg Asr. He Arg Asn Asr. Tyr Ala Glu Met His Ala 

690 695 700 

Tyr Met Gly Lys Phe Asn Gin Arg Gly Val Asp Asp Ala Leu Leu Ser 
20 705 710 715 720 

Leu Lys Thr Gly Lys Leu Asp Ala Phe He Tyr Asp Ala Ala Val Leu 
725 730 735 

25 Asn Tyr Met Ala Gly Arg Asp Glu Gly Cys Lys Leu Val Thr He Gly 

740 745 750 

Ser Gly Lys Val Phe Ala Ser Thr Gly Tyr Gly He Ala He Gin Lys 

755 760 765 



30 



Asp Ser Gly Trp Lys Arg Gin Val Asp Leu Ala He Leu Gin Leu Phe 

771 775 781 



Gly Asp Gly Glu Met Glu Glu Leu Glu Ala Leu Trp Leu Lhr Gly He 
35 785 790 795 SCC 
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Cys His Asr. Glu Lys Asr. Glu Val Met Ser Ser Glr. leu Asp He Asp 



5 Asr. Met Ala Gly Val ?he Tyr Met Leu Gly Ala Ala Met Ala Leu Ser 

S 2 : S 2 5 S 3 : 

Leu lie Thr Phe He Cys Glu His Leu ?he Tyr Trp Gin ?he Arg His 
835 84C 845 

10 

Cys Phe Met Gly Val Cys Ser Gly Lys Pro Gly Met Val Phe Ser He 
850 355 360 

Ser Arg Gly He Tyr Ser Cys He His Gly Val Ala He Glu Glu Arg 
15 865 370 875 880 

Gin Ser Val Met Asn Ser Pro Thr Ala Thr Met Asn Asn Thr His Ser 

885 890 895 

20 Asn He Leu Arg Leu Leu Arg Thr Ala Lys Asn Met Ala Asn Leu Ser 

900 905 910 

Gly Val Asn Gly Ser Pro Gin Ser Ala Leu Asp Phe lie Arg Arg Glu 

915 920 925 

25 

Ser Ser Val Tyr Asp lie Ser Glu His Arg Arg Ser Phe Thr His Ser 
930 935 940 

Asp Cys Lys Ser Tyr Asn Asn Pro Pro Cys Glu Glu Asn Leu Phe Ser 
30 945 950 955 960 

Asp Tyr He Ser Glu Val Glu Arg Thr Phe Gly Asn Leu Gin Leu Lys 

G £ c 9^5 



35 Asp Ser Asn Val Tyr Gin Asp His Tyr His His His His Arg Pro His 
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Ser lie Gly Ser Ala Ser Ser He Asp Gly leu Tyr Asp Cys Asp Asr. 

995 ; ■: : c 1005 

5 

Pro Pro Phe Thr Thr Gin Ser Arg Ser He Ser Lys Lys Pro Leu Asp 
1C10 1015 1020 

lie Gly Leu Pro Ser Ser Lys His Ser Gin Leu Ser Asp Leu Tyr Gly 
10 1025 1030 1035 1040 

Lys Phe Ser Phe Lys Ser Asp Arg Tyr Ser Giy His Asp Asp Leu lie 
1045 1050 1C55 

15 Arg Ser Asp Vai Ser Asp He Ser Thr His Thr Val Thr Tyr Gly Asn 

1060 1065 1070 

He Glu Gly Asn Ala Ala Lys Arg Arg Lys Gin Gin Tyr Lys Asp Ser 
1075 1080 1085 

20 

Leu Lys Lys Arg Pro Ala Ser Ala Lys Ser Arg Arg Glu Phe Asp Glu 
1090 1095 1100 

He Glu Leu Ala Tyr Arg Arg Arg Pro Pro Arg Ser Pro Asp His Lys 
25 1105 1110 1115 1120 

Arg Tyr Phe Arg Asp Lys Glu Gly Leu Arg Asp Phe Tyr Leu Asp Gin 
1125 1130 1135 

30 Phe Arg Thr Lys Glu Asn Ser Pro His Trp Glu His Val Asp Leu Thr 
1140 1145 1150 



35 



Asp He Tyr Lys Glu Arg Ser Asp Asp Phe Lys Arc Asp Ser He Ser 
, „ c j. - - £ - 116 5 
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Gly Gly Gly Pro Cys Thr Asr. Arc Ser His He Lys His Gly rhr Gly 



Asp Lys His Gly Val Val Ser Gly Val Pro Ala Pro Trp Glu lys Asr. 
5 1155 119C 1195 121 1 

Leu Thr Asr. Val Glu Trp Glu Asp Arg Ser Gly Gly Asn Phe Cys Arg 

1205 1210 1215 

10 Ser Cys Pro Ser Lys Leu His Asn Tyr Ser Thr Thr Val Thr Gly Gin 

1220 1225 1230 

Asn Ser Gly Arg Gin Ala Cys He Arg Cys Glu Ala Cys Lys Lys Ala 
1235 1240 1245 

15 

Gly Asn Leu Tyr Asp He Ser Glu Asp Asn Ser Leu Gin Glu Leu Asp 
1250 1255 1260 

Gin Pro Ala Ala Pro Val Ala Val Thr Ser Asn Ala Ser Thr Thr Lys 
20 1265 1270 1275 1280 

Tyr Pro Gin Ser Pro Thr Asn Ser Lys Ala Gin Lys Lys Asn Arg Asn 
1285 1290 1295 

25 Lys Leu Arg Arg Gin His Ser Tyr Asp Thr Phe Val Asp Leu Gin Lys 

1300 1305 1310 

Glu Glu Ala Ala Leu Ala Pro Arg Ser Val Ser Leu Lys Asp Lys Gly 
1315 1320 1325 



30 



Arg Phe Met Asp Gly Ser Pro Tyr Ala His Mel Phe Glu Met Ser Ala 

133 0 1335 134 0 



35 



Gly Glu Ser Thr Phe Ala Asn Asr. Lys Ser Ser Val Pro Ihr Ala Gly 
1345 1351 1355 1361 
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ls His His His Asr. Asn Pro Gly Gly Gly Tyr Me: Leu Ser Lys Ser 



5 Leu Tyr Pro Asp Arg Val Thr Gin Asr Pro Pre He Pro Thr Phe Gly 
138C 13S5 139C 

Asp Asp Gin Cys Leu Leu His Gly Ser Lys Ser Tyr Phe Phe Arg Gin 
1 395 1 4 1" 1 1415 

10 

Pro Thr Val Ala Gly Ala Ser Lys Ala Arg Pro Asp Phe Arg Ala Leu 
1410 1415 1420 

Val Thr Asn Lys Pro Val Val Ser Ala Leu His Gly Ala Val Pro Ala 
15 1425 1430 1435 144C 

Arg Phe Gin Lys Asp He Cys He Gly Asn Gin Ser Asn Pro Cys Val 
1445 1450 1455 

20 Pro Asn Asn Thr Asn Pro Arg Ala Phe Asn Gly Ser Ser Asn Gly His 
1460 1465 1470 

Val Tyr Glu Lys Leu Ser Ser lie Glu Ser Asp Val 
1475 1480 

25 

(2) INFORMATION FOR SEQ ID NO : 1 5 : 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 4 6 95 base pairs 

(B) TYPE: nucleic acid 

( C ) S T RAND E ONE S S : both 

(D) TOPOLOGY: both 

35 (iii MOLECULE TYPE: cDNA 
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(:x.l FEATURE : 

(A) NAME KEY: CDS 

(3) LOCATION: 455. .4495 

5 

(xi) S EQLEXC E DE SC R 1 PTT OX : 3 EQ ID XO : 1 5 : 
CGAGAACACA GCGAGTGTGT GAGTCCCTCC CGCTCCAGCT CCTCCAAGCC GCGGCCGCCG 6 3 

10 CCGCCACCCT CGCCCGCAGC CTCCCGCAGC CTCCCTCGGC CACCGGTGTC TGG'TGGGGG^ 120 

GTTGOCTGGG TAGGTCGGCC CGGCCCCCAG GGGTCTCTCG AGCGTCTGCC ATCTGCCCGA 180 

GAAACATGTG TGGCCACGTC CTCGCCTAGT CCAGGTGGCC GCAACCTTGG GGGAGAGACA 24 0 

15 

GGGCAGGACA GGACCAAGGT AAGAGGTAAG GAG GAGACGG CGCCAGGGAC AGACAGGAGG 3 00 

TCCCGGCTTG CCGTTGTGCG CACCACCACT GCCGCCGCCC CGGGGCCTGC CCCCGACATC 3 6 0 

20 GGCTCTCTGA GCCCTCCTCG GAATCTTGGG GTCGCTGGAC GCCGGGTTCC GGTCCTGGCC 4 2 0 

CCCCCGCCAT CCCCCCAACA GAACAGGGTC ATGAAAAGAG GCCGCCCGGC GGGGCCCGCA 4 80 

GGCG ATG CGC GGC GCC GGT GGC CCC CGC GGC OCT CGG GGC CCC OCT AAG 52 9 
25 Met Arg Giy Ala Gly Gly Pro Arg GLy Pro Arg Gly Pro Ala 2ys 
15 10 15 



ATG CTG CTG CTG CTG GCG CTG GCC TGC GCC AGC CCG TTC CCG GAG GAG 57 7 

Met Leu Leu Leu Leu Ala Leu Ala Cys Ala Ser Pro Phe Pro Glu Glu 
30 20 25 3 0 



GCG CCG GGG COG GGC GGG GCC GGT GGG CCC GOO 0 00 GGC 0TC GGC GGG 

Ala Pro Oly PrD Gly Oly Ala Gly Gly Pro Gly Gly Gly Leu Gly Oly 
35 41 45 

35 
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A ^ a A r a 



3 ro Le - Asr. Val A 1 



CTC GTG TTC TCG GGG CCC GCG 



Phe Se: Gly Pro Ala Tyr Ala 



GCG GAG GCG GCA CGC CTG GGC GCG GCG GTG GCG GCG GCG GTG GGG AGC 
Ala Glu Ala Ala Arg Leu GLy Pro Ala Val Ala Ala Ala Val Arg 5er 



65 



7 : 



jGC jT A G A 7 GTG GGG CCl 



AC 



10 Pro Gly Leo Asp Val Arg Pro Val Ala Leo Val Leo Asr. GLy 3er Asp 

3 1 85 9L 95 



15 



CCG CGC AGC CTC GTG CTG CAG CTC TGC GAC CTG CTG TCG GGG TTG CGC 
Pro Arg Ser Leu Val Leu Gin Leu Cys Asp Leu Leu Ser Gly Leu Arg 



100 



105 



113 



817 



20 



3TC TTC GAA GAC GAC TCG 



Va 1 His 



Gly Val Val 
115 



3TC GCG 



Phe Glu Asp Asp Ser Arg Ala Pro Ala Val Ala 



12 j 



125 



CCC ATC CTC GAC TTC CTG TCG GCG CAG ACC TCG CTC CCC ATC GTG TCC 
Pro He Leu Asp Phe Leu Ser Ala Gin Thr Ser Leu Pro He Val Ser 



133 



135 



143 



865 



913 



25 GAG CAC GGC GGC GCC GCG CTC GTG CTC ACG CCC AAG GA 3 AAG GGC TCC 

Glu His Gly Gly Ala Ala Leu Val Leu Thr Pr 3 Lys Glu Lys Gly Ser 



145 



150 



155 



961 



ACC TTC 



CTG GGC TCT TCC CCC GAG GAA CAG CTT CAG GTG ATC 



30 Thr Phe Leu His Leu Gly Ser Ser Pro Glu Gin Gin Leu Gin Val 



17 ; 



175 



1009 



35 



-.G GAG TAT G AjC - GG ACG «. C Z . - . 
Phe Glu Val Leu Glu Glu Tyr Asp Trp Thr Ser Phe 



i ^ «, ^ a . a - 
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5 



ACT CGT GCC CCT GGC CAC CGG GCC TTC CTG TCC 7 AC ATT GAG GTG CTG 

Thr Arg Ala Pro Gly His Arg Ala Phe Leu Ser Tyr He Glu Val Leu 

195 200 205 

ACT GAC GGC AGT CTG GTG GGC TGG GAG CAC CGC GGA CCG CTG ACG CTG 

Thr Asp G ly Ser leu Val Gly Trp Glu His Arg Gly Ala Leu Thr Leu 

21j 215 22D 



10 GAC CCT GGG GCG GGC GAG GCC GTG CTC AGT GCC CAG CTC CGC AGT GTC 12 01 

Asp Pro Gly Ala Gly Glu Ala Val Leu Ser Ala Gin Leu Arg Ser Val 

225 233 235 

AGC GCG CAG ATC CGC CTG CTC TTC TGC GCC CGA GAG GAG GCC GAG CCC 124 9 

15 Ser Ala Gin lie Arg Leu Leu Phe Cys Ala Arg Glu Glu Ala Glu Pro 
240 245 250 255 

GTG TTC CGC GCA GCT GAG GAG GCT GGC CTC ACT GGA TCT GGC TAC GTC 12 97 

Val Phe Arg Ala Ala Glu Gli Ala Gly Leu Thr Gly Ser Gly Tyr Val 
20 263 265 273 

TGG TTC ATG GTG GGG CCC CAG CTG GCT 3GA GGC GGG GGC TCT GGG GCC 134 5 

Trp Phe Met Val Gly Pro Gin Leu Ala Gly Gly Gly Gly Ser Gly Ala 
275 230 235 

25 

CCT GGT GAG CCC CCT CTT CTG CCA GGA GGC GCC CCC CTG CCT GCC GGG 13 93 

Pro Gly Glu Pro Pro Lei Leu Pro Gly Gly Ala Pro Lei Pro Ala Gly 

2 9 0 2 9 5 3 0 3 

30 CTG TTT GCA GTG CGC TCG GCT GGC TGG CGG GAT GAC CTG GCT CGG CGA 14 41 

Leu Phe Ala Val Arg Ser Ala Gly Trp Arg Asp Asp Lei Ala Arg Arg 



G.G 3LA 3 0 I GGC GTG GCT G.A G.G GCC AGA GG . GCC 0AG G0C C.G J - [ _ 
35 Val Ala Ala Gly Val Ala Val Val Ala Arc Gly Ala Gin Ala Leu Lei 
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# 



CGT GAT TAT GGT TTC CTT CCT GAG CTC GGC CAC GAC TGT CGC GCC CAG 
Arg Asp Tyr Gly ?he Leu Pro Glu leu Gly His Asp Cys Arc Ala Gin 
5 3 4 0 3 4 5 3 5 0 



:37 



10 



AAG CGC ACC OAC CGC GGG GAG AGT CTG CAT AGG TAC TTC ATG AAC ATC 15S5 

Asn Arg Thr His Arg Gly Glu Ser Leu His Arg Tyr Phe Met Asn lie 
3 5 5 3 6 D 3 6 5 

ACG TGG GAT AAC CGG GAT TAC TCC TTC AAT GAG GAC GGC TTC CTA GTG 163 3 

Thr Trp Asp Asn Arg Asp Tyr Ser Phe Asn Glu Asp Gly Phe Leu Val 
370 375 3S0 



15 AAC CCC TCC CTG GTG GTC ATC TCC CTC ACC AG A GAC AGG ACG TGG GAG 

Asn Pro Ser Leu Val Val lie Ser Leu Thr Arg Asp Arg Thr Trp Glu 
335 390 395 



1681 



GTG GTG GGC AGC TGG GAG CAG CAG ACG CTC CGC CTC AAG TAC CCG CTG 
20 Val Val Gly Ser Trp Glu Gin Gin Thr Leu Arg Leu Lys Tyr Pro Leu 

400 4 05 4 10 415 



1729 



25 



TGG TCC CGC TAT GGT CGC TTC CTG CAG CCA GTG GAC GAC ACG CAG CAC 
Trp Ser Arg Tyr Gly Arg Phe Leu Gin Pro Val Asp Asp Thr Gin His 
420 425 430 



1777 



30 



CTC GCG GTG GCC ACG CTG GAG GAA AGG CCG TTT GTC ATC GTG GAG CCT 182 5 

Leu Ala Val Ala Thr Leu Glu Glu Arg Pro Phe Val lie Val Glu Pro 

435 44 0 445 

GCA GAC CCT ATC AGC GGC ACC TGC ATC CGA GAC TCC GTC CCC TGC CGG 187 3 

Ala Asp PrD He Ser Gly Thr Cys He Arg Asp Ser Val Pro Cys Arg 

451 455 461 



35 



AGC CAG CTO 



cc c- 



\GC CCT CCA 0C3 GAT GCC CCC 0GC CC 3 
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Ser Gin Leu A sr. Arc Thr His Ser Pro Pro Pro Asp Ala Pro Arc Pro 



GAA AAG CGC TGC TGC AAG GGT TTC TGC ATC GAG ATT CTG AAG CGG GTG 
Glu Lys Arg Cys Cys Lys Gly Phe Cys He Asp He leu Lys Arg leu 
4 8 0 485 49 0 495 



1969 



10 



GCG CAT ACC ATC GGC TTC AGC TAC GAC CTC TAC CTG GTC ACC AAT GGC 
Ala His Thr He Gly Phe Ser Tyr Asp Leu Tyr leu Val Thr Asn Gly 
503 505 510 



2017 



15 



AAG CAC GGA AAG AAG ATC GAT GGC GTC TGG AAC GGC ATG ATC GGG GAG 

Lys His Gly Lys Lys He Asp Gly Val Trp Asn Gly Met He Gly Glu 

515 520 525 

GTG TTC TAC CAG CGC GCA GAC ATG GCC ATC GGC TCC CTC ACC ATC AAC 

Val Phe Tyr Gin Arg Ala Asp Met Ala He Gly Ser Leu Thr He Asn 

530 535 540 



2065 



■113 



20 GAG GAG CGC TCC GAG ATC GTG GAC TTC TCC GTC CCC TTC GTG GAG ACC 
Glu Glu Arg Ser Glu He Val Asp Phe Ser Val Pro Phe Val Glu Thr 
545 550 555 



1.161 



GGC ATC AGC GTC ATG GTG GCG CGC AGC AAT GGC ACG GTG TCC CCC TCG 
25 Gly He Ser Val Met Val Ala Arg Ser Asn Gl/ Thr Val Ser Pro Ser 

560 565 57D 575 



•209 



30 



GCC TTC CTC GAG CCC TAC AGC CCC GCC GTG TGG GTG ATG ATG TTC GTC 
Ala Phe Leu Glu Pro Tyr Ser Pro Ala Val Trp Val Met Met Phe Val 

580 585 590 



2257 



35 



ATG TGC CTC ACT GTG GTC GCC GTC ACT GTT TTC ATC TTC GAG TAC CTC 
Met Cys Leu Thr Val Val Ala Val Thr Val Phe He Phe Glu Tyr Leu 
595 611 615 
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AGT CCT GTT GGT TAG AAC GGG AG C CTG GCC ACG GGC AAG CGG CCT GGG 
Ser Pro Val Gly Tyr Asn Arg Ser Leu Ala Thr Gly Lys Arc Pro Gl y 



5 GGT TCA ACG TTC ACG ATT GGG AAA TCC ATC TGG CTG CTC TGG GCC CTG 2 4 01 

Gly Ser Thr Phe Thr lie Gly Lys Ser He Trp leu Leu Trp Ala le; 

625 63: 635 

GTG TTC AAT AAT TCG GTG CCC GTG GAG AAC CCC CGG GGA ACC ACC AGT 2 44 9 

10 Val Phe Asn Asn Ser Val Pro Val Glu Asn Pro Arg Gly Thr Thr Ser 

64 0 645 65 J 655 

AAA ATC ATG GTG CTG GTG TGG GCC TTC TTC GCC GTC ATC TTC CTC GCC 2 4 97 

Lys He Met Val Leu Val Trp Ala Phe Phe Ala Val He Phe Lea Ala 

15 660 665 670 

AGC TAC AC A GCC AAC CTG GCC GCC TTC ATG ATC GAG GAG GAG TAG GTG 2 54 5 

Ser Tyr Thr Ala Asn Leu Ala Ala Phe Met He Gin Glu Glu Tyr Val 

675 68D 685 

20 

GAT ACT GTG TCT GGG CTC AGT GAG CGC AAG TTC GAG AGG CCC GAG GAG 2 593 

Asp Thr Val Ser Gly Leu Ser Asp Arg Lys Phe Gin Arg Pro Gin Glu 

690 695 70 0 

25 CAG TAC CCG CCC CTG AAG TTT GGG ACC GTG CCC AAC GGG TCC ACG GAG 2 641 

Gin Tyr Pro Pro Leu Lys Phe Gly Thr Val Pro Asn Gly Ser Thr Gl a 

705 710 715 

AAG AAC ATC CGC AGC AAC TAT CCC GAC ATG CAC AGC TAC ATG GTG CGC 2 6 89 

30 Lys Asn He Arg Ser Asn Tyr Pro Asp Me: His Ser Tyr Met Val Ar g 

720 725 73C 735 

TAC AAC CAG CCC CG0 GTA 3AG GAA GCG CTC A 07 0AG CTC AAG G OA GGG 273 7 

Tyr Asn Gin Pro Arg Val Olu Glu Ala Leu Thr Gin Leu Lys Ala Gly 

35 74: 745 75C 
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GAC GC 



„'/5 ^eu nSD 



GOT GCA 



Phe He Tyr Asp A. a A^a Va - 



CTC AA7 TAG ATG GCC 
leu Asr. Tyr Me: Ala 



CGG AAG GA: GAG GGG TGC AAG ' 
Arg Lys Asp Glu Gly Gys Lys Leu Va L 

^7: 775 



GTC ACG ATG GGG TGG GGG AAG GTC 
r lie Gly Ser Gly Lys Va 1 

7 3C 



2S33 



10 TTG GCC ACG ACA GGC TAT GGC ATG GGG CTG GAG AAG GGC TCC GGG TGG 
Phe Ala Thr Thr Gly Tyr Gly lie Ala Leu His Lys Gly Ser Arg Trp 



735 



790 



795 



2881 



AAG CGG CCG ATG GAC CTG GCG TTG CTG GAG TTG CTG GGG GAT GAT GAG 
15 Lys Arg Pro lie Asp Leu Ala Leu Leu Gin Phe Leu Gly Asp Asp Glu 

833 335 813 815 



;:929 



20 



ATC GAG ATG CTG GAG CGG CTG TGG CTC TCT GGG ATC TGC CAC AAT GAG 
He Glu Met Leu Glu Arg Leu Trp Leu Ser Gly He Cys His Asr. As:: 



323 



•325 



33 3 



i:9?7 



25 



AAA ATC GAG GTG ATG AGC AGC AAG CTG GAC ATC GAC AAG ATG GCG GG : 
Lys He Glu Val Met Ser Ser Lys Leu Asp He Asp Asn Met Ala Gly 



835 



34 3 



34 5 



GTC TTC TAG ATG CTC CTG GTG GCC ATG GGC CTG TCC CTG CTG GTC TTG 
Val Phe Tyr Met Leu Leu Val Ala Met Gly Leu Ser Leu Leu Val Phe 

353 355 363 



3 02 5 



3 073 



30 GGC TGG GAG CAC CTG GTG TAG TG 1 
Ala Trp Glu Hi 



36 D 



IC GTG GGG GAG TGC CTG GGG CC ? 
Leu Val Tyr Trp Arg Leu Arg His Cys Leu Gly Pro 

3 7 3 3 7 5 



3121 



35 



cag gg: 



GAG TTC GTG GTG GGC 



"C TCC AGG GGG A.G TAG AG 3 



3hr His Arg Met Asp Phe Leu Leu Ala Phe Ser Arc Gly Met - yr Ser 
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10 



TGC TGC AGC OCT GAG GGC GCC GCA CCG CCC GCC AAG CCC CCG CCG CCG 
Cys Cys Ser Ala Glu Ala Ala Pro Pro Pro Ala Lys Pro Pro Pro Pro 



CCA CAG CCC CTG CCC AGC CCC GCG TAC CCC GCG CCG GGG CCG GCT CCC 3 2 65 

Pro Gin Pro Leo Pro Ser Pro Ala Tyr Pro Ala Pro Gly Pro Ala Pro 

915 921 925 

jGG CCC GCA CCT TTC GTC CCC CGC GAG CGC GCC TCA GTG GCC CGC TGG 3 313 

Gly Pro Ala Pro Phe Val Pro Arg Glu Arg Ala Ser VaL Ala Arg Trp 

93 D 335 940 



15 CGC CGG CCC AAG GGC GCG GGG CCG CCG GGG GGC GCG GGC CTG GCC GAC 3361 

Arg Arg Pro Lys Gly Ala Gly Pro Pro Gly Gly Ala Gly Leu Ala Asp 
945 95 3 955 

GGC TTC CAC CGC TAC TAC GGC CCC ATC GAG CCG CAG GGC CTA GGC CTC 3 4 09 

20 Gly Phe His Arg Tyr Tyr Gly Pro lie Glu Pro Gin Gly Leu Gly Leu 

960 965 97D 975 

GGC CTG GGC GAA GCG CGC GCG GCA CCG CGG GGC GCA GCC GGG CGC CCG 3 4 57 

Gly Leu Gly Glu Ala Arg Ala Ala Pro Arg Gly Ala Ala Gly Arg Pro 

25 980 935 993 

CTG TCC CCG CCG GCC GCT CAG CCC CCG CAG AAG CCG CCG GCC TCC TAT 3 5 05 

Leu Ser Pro Pro Ala Ala Gin Pro Pro Gin Lys PrD Pro Ala Ser Tyr 

995 1 D DC 1 D35 



30 



ATC GTA CGC GAC AAG GAG CCA GCC GAG CCC CCC GCC GGC GCC 3 55 3 

lie Val Arg Asp Lys Glu ?.u Ala Glu Pr:> Pro Ala Gly Ala 



35 TTC CDC GGC TTC CCG TCC DCG CCC GCG DC 0 CCC GCC GCC 
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Phe 



Ser Pro Pro 



Pro ?r<: 



GCC GTC GGG CCG CCA CTC TGC CGC TTG GCC TTC GAG GAC GAG AGC CCG 3 64? 

5 Ala Val Gly Pro ?r: Leu Cys Arg Leu Ala Phe Glu Asp Glu Ser ?rj 

1041 1045 1C5C 1055 

CCG GCG CCC GCG CGG TGG CCG C 3C TCG GAC CCC GAG AGC CAA CCC CTG 3 6 97 

Pr: Ala Pro Ala Arg Trp Pro Arg Ser Asp ?r d Glu Ser Gin Pro Le_i 

10 105C 1065 1370 



CTG GGG CCA GGC GCG GGC GGC GCG GGG GGC ACG GGG GGC GCA GGC GGA 3 74 5 

Leu Gly Pro Gly Ala Gly Gly Ala Gly Gly Thr Gly Gly Ala Gly Gly 

1375 1030 1035 

GGA GCC CCG GCC GOT CCG CCC CCG TGC TTC GCC GCG CCG CCC CCG TGC 3 7 93 

Gly Ala Pro Ala Ala Pro Pro Pro Cys Phe Ala Ala Pro Pro Pro Cys 

1090 1095 1100 



15 



20 TTT TAG CTC GAT GTC GAC CAG TCG CCG TCG GAC TCG GAG GAC TCG GAG 3 841 

Phe Tyr Leu Asp Val Asp Gin Ser Pro Ser Asp Ser Glu Asp Ser GIj 
1105 1110 1115 

AGC CTG GCC GGC GCG TCC CTG GCC GGC CTG GAT CCC TGG TGG TTC GCC 3 889 

25 Ser Leu Ala Gly Ala Ser Leu Ala Gly Leu Asp Pro Trp Trp Phe Ala 
1120 1125 1130 1135 

GAC TTC CCT TAG CCG TAT GCC GAT CGC CTC GGG CSG CCC GCG GCA CGC 3 937 

Asp Phe Pro Tyr Pro Tyr Ala Asp Arg Leu Gl / Xaa Pro Ala Ala Arg 

30 1140 1145 1151 

TA: GGA TTG GTC GA 0 AAA CTA GGG GGC TGG CT : GCC 3GG AGO TGG GA0 3 9 55 

Tyr Gly Leu Val Asp Lys Leu Gly Gly Trp Leu Ala 31 y Ser Trp Asp 



35 
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TAC CTG CCT CCS CGC 
Tyr Leu Pro Xaa Arc 



AGC GG - CGG GCC GCC * GG C.—.C . G . 
Ser Gly Arc Ala Ala Trp His Cys 



Arg His Cys 



GCC AGC CTG GAG CTG CTT CCG CCG CCG CGC CAT CTC AGC TGC TCG CAC 
Ala Ser Leu Glu Leu Leu Pro PrD Pro Arg His Leu Ser Cys Ser His 
1135 1190 1195 



4C31 



10 



GAT GGC CTG GAC GGC GGC TGG TGG GCG CCA CCG CCT CCA CCC TGG GCC 
Asp Gly Leu Asp Gly Gly Trp Trp Ala Pro Pro Pro Pro Pro Trp Ala 



1200 



1205 



1210 



1215 



15 



GCC GGG CCC CTG CCC CGA CGC CC- 



5CC CGC TGC GGG TGC CCG CGG TCG 



Ala Gly Pro Leu Pro Arg Arg Arg Ala Arg Cys Gly Cys Pro Arg Ser 



1220 



1225 



1230 



4177 



20 



2 AC CCG CAC CGC CCG CGG GCC TC 



:ac cgc ac: 



jCT GCC G 



His Pro His Arg Pro Arg Ala Ser His Arg Thr Pro Ala Ala Ala Ala 



1235 



1240 



1245 



CCC CAC CAC CAC AGG CAC CGG CGC GCC GCT GGG GGC TGG GAC CTC CCG 
Pro His His His Arg His Arg Arg Ala Ala Gly Gly Trp Asp Leu Pro 
1250 1255 1260 



4225 



4273 



25 CCG CCC GCG CCC ACC TCG CGC TCG CTC GAG GAC CTC AGC TCG TGC CCT 
Pro Pro Ala Pro Thr Ser Arg Ser Leu Glu Asp Leu Ser Ser Cys Pro 



1265 



1270 



1275 



4321 



30 



CGC GCC GCC CCT GCG CGC AGG CTT AC 



JGG CCC TCC CGC CAC GCT CGC 



Arg Ala Ala Pro Ala Arg Arg Leu Thr Gly Pro Ser Arg His Ala Arg 



1235 



123C 



1295 



4369 



35 



AGG TGT CCG CAC CCC GCG CAC TGG GGG CCG CCG :TG CCT ACA GCT TC Z 
Arg Cys Pro His Ala Ala His Trp Gly Pro Pro Leu Pro Thr Ala Ser 



4417 
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CAC CGG AG A CAC CGG GGC GGG GAG CTG GGC ACG CGC AGG GGC TCG GGG 446 5 

His Arg Arc His Arg Gly Gly Asp Leu Gly Thr Arc Arg Gly Ser Ala 
1315 132: 1325 

5 

GAC TTC TGT AGC CTG GAG TCG GAG GTA TGACGCGGCG CCGGGGGCCC 4 512 

His Phe Ser Ser Leu Glu Ser Glu Va 1 
1330 1335 

10 CACCGCCCCC TTGGTCAGCG CAGGCCACGG CCCGAGGGGG CGCCGGCAGT GGACAGGACC 4 572 

CGCGTGGGTT GGG AAGG AAA GCAGTGGAAC TGGCCGGAGC CCGCGTGGAG CAGCGTCCTG 4 63 2 

CGCCCCCTGG TTCTGGAGGA ACCGCAAGCC GGAGAGGATT TGGTCCCTCA ACTATCACCC 4 6 92 

15 

AGG 4695 



(2) INFORMATION FOR SEQ ID NO: 16: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1336 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

30 Met Arg Gly Ala Gly Gly Pro Arg Gly Pro Arg Gly Pro Ala Lys Met 

15 1C 15 

Leu Leu Leu Leu Ala Leu Ala :ys Ala Ser Pro Phe Pro Glu Glu Ala 

2 1 2 5 3 1 

35 
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Sly Ala Gly Gly 



Leu Gly 



Arg Pro Leu Asn 

5 5: 

Glu Ala Ala Arg 

65 

10 Gly Leu Asp Val 

Arg Ser Leu Val 

100 

15 

His Gly Val Val 
115 

lie Leu Asp Phe 

20 i3o 

His Gly Gly Ala 
145 

25 Phe Leu His Leu 

Glu Val Leu Glu 
18C 

30 

Arg Ala Pro Gly 

Asp Gly Ser Leu 

35 2:: 



Val Ala Leu Val Phe Ser 
5 5 

Leu Gly Pro Ala Val Ala 

7 0 

Arg Pro Val Ala Leu Val 

85 90 

Leu Gin Leu Cys Asp Leu 
105 

Phe Glu Asp Asp Ser Arg 

120 

Leu Ser Ala Gin Thr Ser 
13 5 

Ala Leu Val Leu Thr Pro 
150 

Gly Ser Ser Pro Glu Gin 

165 170 

Glu Tyr Asp Trp Thr Ser 

185 

His Arg Ala Phe Leu Ser 

2 0C 

Val Gly Trp Glu His Arg 



Gly Pro Ala Tyr Ala Ala 

Ala Ala Val Arg Ser Pro 

75 80 

Leu Asn Gly Ser Asp Pro 
95 

Leu Ser Gly Leu Arg Val 

110 

Ala Pro Ala Val Ala Pro 
125 

Leu Pro lie Val Ser Glu 
140 

Lys Glu Lys Gly Ser Thr 
15 5 16 0 

Gin Leu Gin Val lie Phe 
175 

Phe Val Ala Val Thr Thr 
190 

Tyr lie Glu Val Leu Thr 

2:5 

Gly Ala Leu Thr Leu Asp 
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Pro Gly Ala Gly Glu Ala Val Leu Ser a: 



Leu Arc Ser Val Ser 



5 Ala Gin lie Arg Leu Leu Phe Cys Ala Arg Glu Glu Ala Glu Pro Val 
245 251 255 

Phe Arg Ala Ala Glu Glu Ala Gly Leu Thr Gly Ser Gly Tyr Val Trp 

26C 265 270 

10 

Phe Met Val Gly Pro Gin Leu Ala Gly Gly Gly Gly Ser Gly Ala Pro 
275 280 285 

Gly Glu Pro Pro Leu Leu Pro Gly Gly Ala Pro Leu Pro Ala Gly Leu 
15 290 295 300 

Phe Ala Val Arg Ser Ala Gly Trp Arg Asp Asp Leu Ala Arg Arg Val 

305 310 315 320 

20 Ala Ala Gly Val Ala Val Val Ala Arg Gly Ala Gin Ala Leu Leu Arg 

325 330 335 

Asp Tyr Gly Phe Leu Pro Glu Leu Gly His Asp Gys Arg Ala Gin Asn 
340 345 350 

25 

Arg Thr His Arg Gly Glu Ser Leu His Arg Tyr Phe Met Asn lie Thr 
355 360 365 

Trp Asp Asn Arg Asp Tyr Ser Phe Asn Glu Asp Gly Phe Leu Val Asn 
30 3 7C 37 5 3 50 

Pro Ser Leu Val Val He Ser Leu Thr Arg Asp Arg Thr Trp Glu Val 

385 390 395 4CC 



35 Val Gly Ser Trp Glu Gin Gin Thr Leu Arg Leu Lys Tyr Pro Leu Trp 
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Ser Arc Tyr Gly Arg ?he Leu Gin Pro Val Asp Asp rhr Gin His Leu 
42: 425 432 

5 

Ala Val Ala Thr Leu Glu Glu Arg Pro Phe Val lie Val Glu Pro Ala 
435 44 0 445 

Asp Pro He Ser Gly Thr Cys He Arg Asp Ser Val Pro Cys Arg Ser 
10 45C 455 46 0 

Gin Leu Asn Arg Thr His Ser Pro Pro Pro Asp Ala Pro Arg Pro Glu 
465 470 475 480 

15 Lys Arg Cys Cys Lys Gly Phe Cys He Asp He Leu Lys Arg Leu Ala 
485 490 495 

His Thr He Gly Phe Ser Tyr Asp Leu Tyr Leu Val Thr Asn Gly Lys 

500 505 510 

20 

His Gly Lys Lys He Asp Gly Val Trp Asn Gly Met He Gly Glu Val 
515 520 525 

Phe Tyr Gin Arg Ala Asp Met Ala He Gly Ser Leu Thr He Asn Glu 
25 530 535 540 

Glu Arg Ser Glu He Val Asp Phe Ser Val Pro Phe Val Glu Thr Gly 
545 550 555 560 

30 He Ser Val Met Val Ala Arg Ser Asn Gly Thr Val Ser Pro Ser Ala 

565 570 575 



35 



Phe Leu Glu Pro Tyr Ser Pro Ala Val Trp Val Met Met Phe Val Met 
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Pro Val Gly Tyr A sr. Arg Ser Leu Ala Thr Gly Lys Arg Pro Gly Gly 
5 61 Z 615 621 

Ser Thr Phe Thr lie Gly Lys Ser lie Trp Leu Leu Trp Ala Leu Val 

625 63G 635 64C 

10 Phe Asn Asn Ser Val Pro Val Glu Asn Pro Arg Gly Thr Thr Ser Lys 

645 650 655 

lie Met Val Leu Val Trp Ala Phe Phe Ala Val lie Phe Leu Ala Ser 
660 665 670 

15 

Tyr Thr Ala Asn Leu Ala Ala Phe Met lie Gin Glu Glu Tyr Val Asp 
675 680 685 

Thr Val Ser Gly Leu Ser Asp Arg Lys Phe Gin Arg Pro Gin Glu Gin 
20 690 695 700 

Tyr Pro Pro Leu Lys Phe Gly Thr Val Pro Asn Gly Ser Thr Glu Lys 
705 710 715 720 

25 Asn lie Arg Ser Asn Tyr Pro Asp Met His Ser Tyr Met Val Arg Tyr 

725 73 0 735 

Asn Gin Pro Arg Val Glu Glu Ala Leu Thr Gin Leu Lys Ala Gly Lys 

74 0 745 75 0 

30 

Leu Asp Ala Phe lie Tyr Asp Ala Ala Val Leu Asn Tyr Met Ala Arg 

75 5 761 76 5 

Lys Asp Glu Gly Cys Lys Leu Val Thr He Gly Ser Gly Lys Val Phe 
35 771 775 78 C 
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Ala Thr Thr Gly Tyr Gly He Ala Leu His Lys Gly Ser Arg . rp Lys 

785 79 0 795 82 2 

5 Arg Pro lie Asp Leu Ala Leu Leu Gin ?he Leu Gly Asp Asp Glu lie 
805 310 815 

Glu Met Leu Glu Arg Leu Trp Leu Ser Gly He Cys His A sr. Asp Lys 

320 325 830 

10 

He Glu Val Met Ser Ser Lys Leu Asp He Asp Asn Met Ala Gly Val 
835 840 845 

Phe Tyr Met Leu Leu Val Ala Met Gly Leu Ser Leu Leu Val Phe Ala 
15 850 855 360 

Trp Glu His Leu Val Tyr Trp Arg Leu Arg His Cys Leu Gly Pro Thr 

865 37 0 875 88 0 

20 His Arg Met Asp Phe Leu Leu Ala Phe Ser Arg Gly Met Tyr Ser Cys 

885 390 895 

Cys Ser Ala Glu Ala Ala Pro Pro Pro Ala Lys Pro Pro Pro Pro Pro 

9 0 0 9 0 5 910 

25 

Gin Pro Leu Pro Ser Pro Ala Tyr Pro Ala Pro Gly Pro Ala Pro Gly 

915 920 925 

Pro Ala Pro Phe Val Pro Arg Glu Arg Ala Ser Val Ala Arg Trp Arg 
30 930 935 940 

Arg Pro Lys Gly Ala Gly Pro Pro Gly Gly Ala Gly Leu Ala Asp Gly 
35 Phe His Arg Tyr Tyr Gly Pro He Glu Pro Gin Gly Leu Gly Leu Gly 
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Leu Gly Glu Ala Arc Ala Ala Pro Arc Gly Ala Ala Gly Arc Pro Leu 
?S1 935 991 

5 

Ser Pro Pro Ala Ala Gin Pro Pro Gin Lys Pro Pro Ala Ser Tyr Phe 

995 10C0 1CG5 

Ala lie Val Arg Asp Lys Glu Pro Ala Glu Pro Pro Ala Gly Ala Phe 
10 101 C 1015 102 0 

Pro Gly Phe Pro Ser Pro Pro Ala Pro Pro Ala Ala Ala Ala Thr Ala 
1025 1030 1035 1040 

15 Val Gly Pro Pro Leu Cys Arg Leu Ala Phe Glu Asp Glu Ser Pro Pro 
1045 1050 1055 

Ala Pro Ala Arg Trp Pro Arg Ser Asp Pro Glu Ser Gin Pro Leu Leu 
1060 1065 1070 

20 

Gly Pro Gly Ala Gly Gly Ala Gly Gly Thr Gly Gly Ala Gly Gly Gly 
1075 1080 1085 

Ala Pro Ala Ala Pro Pro Pro Cys Phe Ala Ala Pro Pro Pro Cys Phe 
25 1090 1095 1100 

Tyr Leu Asp Val Asp Gin Ser Pro Ser Asp Ser Glu Asp Ser Glu Ser 
1105 1110 1115 1120 

30 Leu Ala Gly Ala Ser Leu Ala Gly Leu Asp Pro Trp Trp Phe Ala Asp 

1125 1130 1135 

Phe Pro Tyr Pro Tyr Ala Asp Arg Leu Gly Xaa Pro Ala Ala Arg Tyr 
1141 1145 1151 

35 
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Leu Val Asp Lys Leu Gly Gly 



Leu Pro Xaa Arg Ser Gly Arg Ala Ala Trp His Cys Arg His Cys Ala 
5 1170 1175 11SC 

Ser Leu Glu Leu Leu Pro Pro Pro Arg His Leu Ser Cys Ser His Asp 
1185 1190 1195 1200 

10 Gly Leu Asp Gly Gly Trp Trp Ala Pro Pro Pro Pro Pro Trp Ala Ala 
12C5 1210 1215 

Gly Pro Leu Pro Arg Arg Arg Ala Arg Cys Gly Cys Pro Arg Ser His 
1220 1225 1230 

15 

Pro His Arg Pro Arg Ala Ser His Arg Thr Pro Ala Ala Ala Ala Pro 
1235 1240 1245 

His His His Arg His Arg Arg Ala Ala Gly Gly Trp Asp Leu Pro Pro 
20 1250 1255 1260 

Pro Ala Pro Thr Ser Arg Ser Leu Glu Asp Leu Ser Ser Cys Pro Arc 
1265 1270 1275 12&0 

25 Ala Ala Pro Ala Arg Arg Leu Thr Gly Pro Ser Arg His Ala Arg Arg 

1285 1290 1295 

Cys Pro His Ala Ala His Trp Gly Pro Pro Leu Pro Thr Ala Ser His 
1300 1305 1310 

30 

Arg Arg His Arg Gly Gly Asp Leu Gly Thr Arg Arg Gly Ser Ala His 
1315 1321 1325 

Phe Ser Ser Leu Glu Ser Glu Val 
35 1331 1335 
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2) INFORMATION FOR SEQ ID NO : 1 7 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 71 base pairs 
(3) TYPE: nucleic acid 

( C ) STRANDEDNESS : bo t h 

(D) TOPOLOGY: both 

(ll) MOLECULE TYPE: cE>NA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 7 : 
15 GGGTGGCGGC CGCAGAGCAC CTCCACCATC TCCTTGTCCT ACTCCAAGAT CTGGCCCTAG 6 0 

TCCATGTTTG C 71 



10 



20 (2) INFORMATION FOR SEQ ID NO : 1 3 : 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 71 base pairs 
(3) TYPE: nucleic acid 
25 (C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(li) MOLECULE TYPE : cDNA 

30 (xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 1 8 : 

TGGTGGTCCC CAACCTGTAG GACTTGGTTC TGGAGGAGGA TCTGGTGTAG GCAAACATGG 

ACTAGGGCCA G 

35 
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il; INFORMATION FOR SEQ I D NO : 1 5 : 

(l) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 61 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

( D ) TOPOLOGY: both 

10 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



15 



GTTGGGGACC AC C AG ATGG A GGTAGAGCTG CACTTGTACG AAGAGCTCCA CAACCACCTG 6 0 



G 61 



(2) INFORMATION FOR SEQ ID NO : 2 0 : 



20 



(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
25 (D) TOPOLOGY: both 

(n) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



30 



35 



CGTGAGACGT CAGACAAAGG AGGCCCAGGT GTAGGTGGTC TACCAGGTGG TTGTGGAGCT 6 0 
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;2i INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 base pairs 
5 (3) TYPE: nucleic acid 

(C) STRANDEDNESS: born 

( D ) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CCGCAGAGCA CCTCCACCAT CTCCTTGTCC TACTCCAAGA TCTGGCCCTA GTCCATGTTT 6 0 

15 GCCTACACCA GATCCTCCTC CAGAACCAAG TCCTACAGGT TGGGGACCAC CAGATGGAGG 12 0 

TAGAGCTGCA CTTGTACGAA GAGCTCCACA ACCACCTGGT AGACCACCTA CACCTGGGCC 180 
TCCTTTGTCT GACGT 195 

20 
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